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MIXING TIMES FOR A CONSTRAINED ISING PROCESS ON THE 

TORUS AT LOW DENSITY 

NATESH S. PILLAI* AND AARON SMITH# 


Abstract. We study a kinetically constrained Ising process (KCIP) associated with a 
graph G and density parameter p; this process is an interacting particle system with state 
space {0,1} G . The stationary distribution of the KCIP Markov chain is the Binomial(|G|, 
p ) distribution on the number of particles, conditioned on having at least one particle. The 
‘constraint’ in the name of the process refers to the rule that a vertex cannot change its state 
unless it has at least one neighbour in state T’. The KCIP has been proposed by statistical 
physicists as a model for the glass transition, and more recently as a simple algorithm for 
data storage in computer networks. In this note, we study the mixing time of this process 
on the torus G = T, d Ll d > 3, in the low-density regime p = jA- for arbitrary 0 < c < oo; this 
regime is the subject of a conjecture of Aldous and is natural in the context of computer 
networks. Our results provide a counterexample to Aldous’ conjecture, suggest a natural 
modification of the conjecture, and show that this modification is correct up to logarithmic 
factors. The methods developed in this paper also provide a strategy for tackling Aldous’ 
conjecture for other graphs. 


1. Introduction 

The kinetically constrained Ising process (KCIP) refers to a class of interacting particle 
systems introduced by physicists in [FA84, FA85] to study the glass transition. Versions 
of this process have accrued other names since then, including the kinetically constrained 
spin model, the east model [AD02] and the north-east model [Val04]. These models have 
attracted a great deal of interest recently, including applications to combinatorics, computer 
science, and other areas; [CM13, CFM14a] have useful surveys of places that the KCIP has 
appeared outside of the physics literature. Recent mathematical progress has included new 
bounds on the mixing time of the KCIP in various different regimes [KL06, CMRT09, MT13, 
CM13, CFM14b, CFM14a, CFM15]. For a more complete review of recent progress on KCIP 
within the physics community, see the survey [GST 11] and the references therein. 

In this note, we study a simple discrete-time version of this process, though our main 
result applies, after suitable time scaling, to standard continuous-time analogues as well. 
Fix a graph G = (V, E ) and a density parameter 0 < p < 1. For a set S, we denote by 
Unif(S') the uniform distribution on S. Define a reversible Markov chain {A^}^ on the set 
of {0, l}-labcllings of G as follows. To update the chain X t , choose 

v t ~ Unif(V), 

Pt ~ Unif([0,1]). 
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If there exists u E V such that (u,v t ) E E and X t [u] = 1, set X t+ \ [u t ] = 1 if pt < p and 
set X t+ i[v t ] = 0 if p t > p. If no such u E V exists, set X t+ \ [v t ] = X t [vf\. In either case, set 
X t+1 [w] = X t [w] for all w E K\{u t }. 

The state space for the KCIP {A^} feN on a graph G is = {0,1} G . Set \V\ = n: for 
general points x E {0,1,..., |V|} G , write |x| = Let tt denote the stationary 

distribution of {A t } ieN . For y E O, this is given by 

t r(y) = 1 p ]y] (1 - p ) n ~ ly ' l| y |>o, (1-2) 

^>KCIP 

where >Zkcip = 1 — (1 — p) n is the normalizing constant (see formulas (4.1) & (4.2) below). 
Thus 7r(y) is proportional to the Binomial(n,p) distribution on the number of non-zero labels 
in y E O, conditional on having at least one non-zero entry. 

In this paper we study a conjecture stated by David Aldous in [Aid] about the mixing 
time of this process. To state Aldous’ conjecture, we recall some standard notation that will 
be used throughout the paper. For sequences x = x(n),y = y(n) indexed by N, we write 
y = 0(x) for sup n < C < oo and y = o(x) for lim sup, woo ||LL| = 0. Recall that for 
distributions on a common measure space (©,,4), the total variation distance between 
p, and v is given by 

II P ~ HI tv = sup (n(A) - o(A)). 

AgA 

The mixing profile for the KCIP Markov chain {A^}^ on D with stationary distribution tt 
is given by 

r(e) = inf It > 0 : sup ||£(A t ) — 7r|| TV < e} 

1 X q =xGQ J 

for all 0 < e < 1. As usual, the mixing time is defined as r mix = tQ). Aldous’ conjecture is 
[Aid]: 

Conjecture 1 (Aldous). The mixing time r m i x of the constrained Ising process with parame¬ 
ter p on graph G is 0(p~ l \E\ t™), where % s the mixing time of the lazy simple random 
walk on the graph G. 

Although Conjecture 1 is quite general, it was made in the context of studying the KCIP 
on a sequence of graphs {G„} ne N with associated density p = p n = for some fixed 
0 < c < oo ([Aid]). This scaling regime for p n is natural for studying the low-temperature 
limit of the physical process and has been referred to as the natural equilibrium scale [Mar 14]; 
however, its motivation in [Aid] is as a model for data storage in computer networks rather 
than as a model for physical processes. 

For a positive integer L E N, let A (L, d ) denote the d- dimensional torus with n = L d 
points; this is a Cayley graph with vertex set, generating set and edge set given by 

v = z d L , 

Gen = {(1, 0, 0,..., 0), (0,1,0,..., 0),..., (0,0,0,..., 1)}, 

E = j(w, v) E V x V : u — v E ±Gen}. 


n = |A(L, d)\ =L d . 
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Set 





In this paper, we study the KCIP on a sequence of graphs {A(L, d)}Len with density 

P = Pn=~ (1-3) 

n 

for some fixed constant 0 < c < oo and fixed dimension d > 3; see Section 9 for a brief 
explanation of how our method applies when d and c are allowed to vary with L. 

The mixing time of the simple random walk on G = A(L, d) is known to be rffA ~ n z 
(see, e.g., Theorem 5.5 of [LPW09]). Thus Aldous’ conjecture for G = A (L,d) suggests a 
mixing time of r mix = 0(n 2+ d). This is correct for d — 1 (as shown in [AD02]), but we show 
in Theorem 2 below that this conjecture is incorrect for d > 3. 

The following is our main result, in which we prove a modified version of Conjecture 1 for 
the torus: 

Theorem 2 (Mixing of the Constrained Ising Process on the Torus). Fix 0 < c < oo and 
d > 3. For p = p n defined in (1.3), the mixing time of the KCIP on A(L,d) satisfies 

Cyn 3 < r mix < C 2 n 3 log(n) 

for some constants Ci,C 2 that may depend on c,d but are independent of n. 

Remark 1.1. As can seen from Theorem 3 below, the lower bound for the mixing time of 
KCIP obtained in Theorem 2 is valid for any m-regular graph (m > 1) with no triangles. 

In the statement of Theorem 2 and throughout the paper, we assume that both the 
quantities 0 < c < oo and 3 < d G N are fixed; only n grows. In particular, in Theorem 
2 and all other calculations, bounds that are ‘uniform’ are implied to be uniform only in 
n and other explicitly mentioned variables; they will generally not be uniform in c or d. 
Throughout, we will denote by C a generic constant, whose value may change from one 
occurrence to the next, but is independent of n. For u,v G A (L,d), denote by |u — u| the 
smallest number of edges needed to traverse from u to v via a connected path; this is the 
usual graph distance. Finally, let 

Be(v) = {w G A(L, d) : \v — w\ < 1} (1.4) 

be the ball of radius i around v in the graph distance. 

1.1. Relationship to previous work. Although the KCIP we study was introduced in 
the physics literature [FA84, FA85] and discussed in later work such as [Aid], most recent 
mathematical work on mixing bounds for KCIPs has focussed on different local constraints, 
which give rise to qualitatively different behaviour. Thus, even if we studied the same regime, 
our results would not imply (or be implied by) recent work in this area. However, our primary 
contribution to the literature is the fact that our work is in a new regime: we obtain good 
mixing bounds on a KCIP that apply in the regime of high dimension d > 3, low density 
p n ~ n -1 , and under the strong metric || • ||tv- All three of these distinctions can make the 
problem harder than working in dimension d — 1, at high density p n = p, or in a weaker 
metric. 

We briefly review some recent work on the mixing properties of related constrained Ising 
processes [KL06, CMRT09, MT13, CM13, CFM14b, CFM14a, CFM15]. However our results 
in the regime which interests us are novel. We contrast our work with specific papers. Many 
previous results, such as [CM13, CFM15], deal primarily with the regime in which p is a 
constant, independently of n. In this regime, many KCIPs mix relatively quickly and the 
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Figure 1. Simple Random Walk Heuristic 



Figure 2. Particle Creation Heuristic 

obstacles to mixing are quite different. Other results, such as [AD02, GLM], study the 
small-p regime, but only in one dimension. In particular, the methods employed in [AD02] 
completely break down for d > 1 and thus are not applicable to our setting. The recent 
paper [CFM14a] seems most similar to ours. In [CFM14a] the authors study the mixing of a 
related KCIP on A(L, d ) at density p ~ ^ and obtain results in greater generality than ours. 
However, the authors of [CFM14a] focus on bounding the relaxation time of the process, 
rather than the mixing time; the bounds obtained in [CFM14a] cannot be used to obtain 
sharp estimates on the mixing time. 

1.2. Outline for the paper. This paper is largely devoted to the proof of Theorem 2. In 
Section 2, we set up notation and give a proof sketch. In Section 3, we give a general upper 
bound on the mixing time of Markov chains on finite state spaces; the rest of the proof of 
the upper bound in Theorem 2 consists of estimating the constants in this general bound. In 
Section 4, we give the lower bound in Theorem 2. Sections 5 to 7 contain most of the work 
required to prove the upper bound in Theorem 2. In Section 5, we study the behaviour of 
the KCIP at low density by comparing it to the simple exclusion process. In Sections 6 and 
7, we detail the behaviour of the KCIP at high density by comparing it to the coalescence 
process. In Section 8, we combine the results obtained in earlier sections and give the proof 
of Theorem 2. Finally, in Section 9 we discuss related problems and the extent to which our 
methods apply to them. 


2. A Roadmap for the Proof 

We first explain the heuristic arguments that make Conjecture 1 plausible and point out 
the key time scales involved. For the reader’s convenience, we also give a proof sketch 
highlighting all the major steps involved in the proof of Theorem 2. 

2.1. Heuristics and key time scales. We discuss heuristics for the time scales on which 
important changes to the KCIP occur. 

(1) If the initial state A"o has a large number of particles, the number of particles re¬ 
maining will generally be cut in half every 0(n 3 ) steps. This observation is crucial 
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to our proof. We show this bound using closely related bounds on the ‘coalescence 
time’ of a collection of random walkers on a graph (see [Cox89]). This bound implies 
that, after an initial transient period of at most 0(n 3 log(n)) steps, the number of 
particles in the KCIP will be 0(1). This is the only place we obtain an extra logn 
factor in our proof of the upper bound of Theorem 2. We suspect that, in fact, this 
transient period is only of total length 0(n 3 ). 

(2) After the initial transient period, the KCIP generally has 0(1) well-separated parti¬ 
cles. Every 0(n 2 ) steps, a particle will spawn a neighbour; one of these neighboring 
particles will be removed in 0(n) steps. Ignoring the times at which any particle has 
a neighbour, the 0(1) well-separated particles will appear to be evolving according to 
an independent random walk on A(L, d), slowed down by a factor of roughly n 2 ; see 
Figure 1. Since all particles have no neighbours at most times under the stationary 
measure, ignoring the times at which any particle has a neighbour does not greatly 
influence one’s view of the process. 

(3) After the initial transient period, and again ignoring the times at which any particle 
has a neighbour, it often takes 0(n 3 ) steps to decrease the number of particles by 1. 
To see this, note that the number of particles can only decrease when two existing 
particles ‘collide.’ Recall that the expected collision time for two random walkers on 
the torus is 0(n). By the above heuristic, the particles in the KCIP are undergoing 
simple random walk that is slowed down by a factor of n 2 ; thus, the expected time 
to a collision is 0{n 3 ). It also turns out that collisions ‘often’ result in the number 
of particles being decreased. 

(4) After the initial transient period, and again ignoring the times at which any particle 
has a neighbour, it takes 0[n 3 ) steps to increase the number of particles by 1. Indeed, 
increasing the number of non-adjacent particles by 1 requires an intermediate time 
at which three particles are ‘touching’ in A (L, d ), where two of them share the same 
neighbor. It takes 0(n 2 ) steps for a particle to spawn a neighbouring particle, and 
whenever two particles are adjacent, the probability of one of these two particles 
being removed before a third ‘touching’ particle is added is 1 — 0(n _1 ). Thus, it 
takes 0(n 3 ) steps to obtain three ‘touching’ particles; see Figure 2 for a generic 
illustration of how this happens. It is easy to check that a configuration with three 
‘touching’ particles often degenerates into one with two non-adjacent particles after 
0(n) steps. 

The basis of Conjecture 1 is heuristic (2) above: individual particles at distance greater than 
one in the KCIP on A (L,d) tend to behave like independent random walkers on A (L,d), 
slowed down by a factor of roughly n 2 . Thus, for density p = -, we might expect the KCIP 
to have roughly c particles during most times and to behave quite similarly to the simple 
exclusion process (SEP) with c particles (see [CS73, HL75] for an introduction). Heuristics 
(1) and (4) explain why Conjecture 1 is not telling the whole story: heuristic (1) points out 
that it takes a long time to go from n particles to 0(1) particles, while heuristic (4) points 
out that it takes a long time to go from 1 particle to 2 well-separated particles. The lower 
bound in Theorem 2 is obtained by making heuristic (4) rigorous. 

An obvious modification to Conjecture 1 is that the mixing time of the KCIP is at most 
the maximum of these three time scales, and this is the approach we take in this paper. 
There are essentially three obstacles to making this approach rigorous. The Erst is to deal 
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with the fact that we would like to compare a single KCIP to many different SEPs - the SEP 
with c particles, but also the SEP with c — 1 particles, c + 1 particles, etc. Breaking apart 
the KCIP in this way is the subject of Step 1 and Step 4 in the proof sketch below. The 
second is to complete the comparison of the EP to a suitably tamed version of the KCIP. 
This is the subject of Step 2 below. The third is to ensure that a KCIP started with a large 
number (3> c) particles quickly enters a state with roughly c particles. This corresponds to 
Step 3 below. 

2.2. Proof sketch. Our methods in some of the steps in the following outline are applicable 
for a generic graph G , and the others are specific to the torus A (L,d). For 1 < k < |, let 
flfc C O be configurations of k particles for which no two particles are adjacent, i.e., 

n fc = {Xe{0,l} G : ^ 2 x [v] = k, Y X[u]X[v] = 0 }. (2.1) 

v£V (u,v)^E 

— / n\ 

Also set O' = fl\ U 2 =1 Q* ; . For each k < n, we will denote by 7 W the mixing time of X t 
‘restricted to’ (our notion of ‘restriction’ is defined more carefully in Section 5). Define 
the quantity 

T 

OcCfc(e, N) = sup inf > 1 : X 0 = x, P(^ lx s ef 4 > N ) > 1 - e 
x&n s=0 

For a fixed N and small e, Occfc(e, N) denotes the first (random) time at which the occupation 
measure of X t in Q k exceeds N with high (1 — e) probability. For x G define the exit 
time: 

L k (x) = inf {t : X 0 = x, X t G U(2.2) 
Our proof strategy for the upper bound in Theorem 2 entails the following steps. 

Step 1. We show that for a universal constant k max = /c max (G, c) depending only on the 
local structure of the graph G and the constant c from (1.3), 

r mix = 0( sup T^l + sup OcCfc (—^ , Ct^ x ) ) . 

This is an immediate consequence of Lemma 3.1. 

Step 2. By a comparison argument using the simple exclusion process, we show that t W = 
^(( a rul)^ 1 ulogn) uniformly in 1 < k < fc max , where af^ k is the log-Sobolev constant 
of the simple exclusion process on G with k particles. For G = A (L,d), it is known 
that (a^) -1 = 0(n 1+ §) (see [Yau97]) and thus t^ x = 0(n 2+ d logn) uniformly in 
1 < k < k mSLX for the KCIP on A(L, d ). See Lemma 5.1. 

Step 3. For G = A(L,d), by coupling the KCIP to a ‘colored’ version of the coalesence 
process over short time periods, we show that the process 

U = £>,[«] (2.3) 

v&V 

satisfies the ‘drift condition’ 

nv t+en s - V t \X t ] < -SVt + c 

for some C < oo and e, 5 > 0, all independent of n. See Theorem 6.1. 
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Step 4. For G = A (L,d), by direct calculation and further comparison to a coalesence 

process, we show that 

inf P \Lk(x) > Cin 3 ] > e > 0 

and 

sup P [Lk(x) < C 2 n 3 ] > e > 0 

for some constants C±, C 2 , e > 0 independent of n. See Lemmas 7.3 and 7.6. 

Step 5. Conclude from Step 3 and Step 4 that sup 1<fc<fcmax OcCfc(gp—, Ct^ x ) = 0(n 3 log(n)). 

See Lemma 8.1. 

The lower bound is obtained in Theorem 2 by a direct computation of the expected time 
for the KCIP { X t } on A(L, d) to hrst have two non-adjacent particles when started from a 
single particle. Some of the steps above that are specific to the torus A (L, d ) can be extended 
for other graphs as well, as discussed in Section 9. 

3. General Mixing Bounds for Decomposable Markov Chains 

In this section, we give a general bound on the mixing time of decomposable Markov 
chains. This result will be later applied to the KCIP Markov chain to achieve Step 1 in 
the proof of Theorem 2. The bounds in this section apply to Markov chains other than the 
KCIP and thus are of independent interest; they are developed further and applied to other 
examples in the forthcoming note [PS]. 

Consider an aperiodic, irreducible, reversible and |-lazy Markov chain with tran¬ 

sition matrix K and stationary distribution n on a finite state space 0. Our goal is to bound 
the mixing time of in terms of the mixing times of various restricted and projected 

chains; these L 1 mixing bounds are loosely analogous to the L 2 bounds in [JSTV04], We 
begin by fixing n and writing the partition 0 = U£ = 1 0fc. For 1 < k < n, set 77 *.(0) = 0 and 
for sfM, recursively define the sequences of times 

Vk(s) = mf{i > 7 ] k (s - 1) : Z t e @ fc }, (3.1) 

tc fc(s) = sup {77 : r] k (u) < s}. 

The quantity tc*, can also be written as 

T 

Kk ( T ) = ^2 1 z t e© fe - (3.2) 

t=i 

Both 77 , ft depend on the initial condition Z\. We also define the associated restricted pro¬ 
cesses by 

z[ k) =z m w . (3.3) 

This process is called the trace of {Z t } te¥S on 0 fe . Since {Z t } teN is recurrent, we have for all 
t E N that rjk(t) < 00 almost surely, and so Z^ is almost surely well-defined for all t e N. 
The process {Z^} te ^ is a Markov chain on 0 fc , and denote by K k the associated transition 
kernel on 0*.. The kernel K k inherits aperiodicity, irreducibility, reversibility and ^-laziness 

from K and its stationary distribution is given by 7 r k (A) = ^ or ^ c ©fe- We denote 
by (fk the mixing time of the kernel K k - 
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For 0 < a < |, define the universal constants c a and c' a as in Theorem 1.1 of [PS13]. We 
then have the following simple bound on the mixing time of K: 

Lemma 3.1 (Basic Mixing Bound). Fix 0 < a < \ and 1 — a < f3 < 1. Define 7 = 
min Q, > 0 and fix a collection of indices I C {1, 2 ,,n} satisfying 

^7r(0 fc ) > p. 
kei 

Then the mixing time of {Z t } teN satisfies 

^ < T = inf It : inf sup (^ + supP JkJT) < t]) < U. 

4c a l 0 <t<T keI V t 2 ge / 4i 

Thus Lemma 3.1 relates the mixing time of a Markov chain to the mixing times of its traces 
on subsets of the state space with reasonably large stationary measure and the corresponding 
occupation times on those subsets. This is possible due to the remarkable results obtained in 
[PS13, 01il2a], where the authors obtain a bound on the mixing times of reversible Markov 
chains in terms of hitting times. 

Remark 3.2. Since our mixing bounds in Section 5 below are obtained by a bound on the 
log-Sobolev constant of various restricted chains, and our bound on occupation measure can 
easily be translated into a bound on the spectral gap of an associated projected chain, the 
reader may ask why we do not use the well developed theory in [JSTV04] for decomposable 
Markov chains to bound the mixing of a Markov chain in terms of restricted and projected 
chains. One reason is convenience: unlike the bounds in [JSTV04], Lemma 3.1 requires only 
a bound on for some k, not all k. Since the bounds in Section 5 apply only for k small, 
Lemma 3.1 allows us to avoid doing the substantial extra work of finding bounds on ip k for 
k large. The second, and more important, reason is that it is impossible to get an upper 
bound on the mixing time that is smaller than 0(n 4 log(n)) using bounds from [JSTV04] 
and any partition of similar to the partition that we use. The extra factor of n comes 
primarily from the fact that the probability of moving from Tl k +i to f4 within 0(n 2 ) steps 
is very far from uniform over starting points x £ f4+i- 

4. Lower bound for the mixing time of KCIP on A(L, d) 

In this section, we give a direct computation that leads to a lower bound on the mixing 
time of the KCIP on any m-regular graph G = (V, E) that contains no triangles. The torus 
A(L, d) is 2d-regular and triangle-free, and this bound will immediately give the lower bound 
in Theorem 2. Before proceeding, we verify that the stationary distribution 7 r is given by 
formula (1.2). If P[X t+1 = y\X t — x] > 0, then either 

P[X m = y\X t = x} = -P—F[X t+1 = x\X t = y] (4.1) 

1 — p 

= !W t+1 = x \X t = y] 

7T(X) 

or 

P[W +1 = V\x t = x\= l —^nx t+1 = x\X t = X] (4.2) 








*(y) 

7r(x) 


P[AVi = x\X t = y], 


depending on whether G x[v\ < gJ/M or n °L Thus the Markov chain 
satisfies the detailed balance equation with respect to tt and thus has tt as its stationary 
distribution. 

We continue by setting notation that will be used throughout the remainder of the paper. 
Let G t be the subgraph of G induced by the vertices \v G G : X t [v\ = 1}, with vertices 
V(G t ) and edges E(G t ). For a vertex u G A (L,d), define Comp t (u) to be the collection 
of vertices contained in the same connected component as u in G t , define Comp t to be the 
collection of distinct connected components in G t , and let 

Y t = | Comp t | 

be the number of connected components in G t . Recall that V t = YlveG^t M = \G t \ is the 
number of vertices having state 1. For a KCIP started at time 0 with Vo — 1, define the 
associated triple time by 

Ctripie = inf{t : Vt > 3}. (4.3) 

Lemma 4.1 (Component Growth). Let G be an m-regular graph (m > 1) with no triangles. 
Fix e > 0 and assume that V 0 = 1. Then 

^3 


P [(triple < 


n 


: 1 = 0 (‘). 


(4.4) 


3 c 2 m(m — 1) ■ 

where the implied constant is uniform over 0 < e < eo sufficiently small and does not depend 
on G or c. 

Proof. Define the matrix 


K =U( 1 _£) 1 

l n v n / 

o 


c(2m— 1) 

- 2 - 


n v n J 


o 


c(2m— 1) 

V 


(4.5) 


It is straightforward to check that, for 0 < s < (triple and a G {1, 2, 3}, 

F[V s+1 = a\X s } = K[V s ,a\. (4.6) 

Most significantly, X s appears on the right-hand side only through V s . We justify this by 
considering the various cases. If V s = 1, the transition probabilities P[W+i = i\Xf\ depend 
only on the number of vertices labelled 1 at time s (this is 1 ) and the number of vertices 
adjacent to this vertex (since our graph is m-regular, this is m). If V s = 2, the transition 
probabilities P[W+i = i\X s ] depend only on the number of vertices labelled 1 at time s (this 
is 2 ), the number of vertices labelled 1 adjacent to other vertices labelled 1 (this is also 2 ) 
and the number of vertices labelled 0 adjacent to vertices labelled 1 (since our graph is both 
m-regular and triangle-free, this is 2m — l ). 1 

By direct computation, 

El [(triple] = (-l) + [(triple] 

v cm 


lr The transition probability P[D_(_i = i\V s = 3] is independent of X s only if G contains no 4-cycles; this is 
of course not the case for our graphs of interest. 
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(-(1 - £ ) + c(2m 2 1) )E 2 [C Mp i.] = 1 + -(1 - -WCwpJ, 


n 


n 




n 


n 


and so we obtain 


El [Ctripie] — 


n 


+ 0(n 2 ). 


(4.7) 


c 2 m(m — 1) 

We recall from [KM59] (and in particular the rewriting and generalization of that theory in 
[MiclO]) that the distribution of Ctripie is given by a weighted sum of two geometric distribu¬ 
tions. Thus, denoting Geom(g) for the geometric distribution with mean q, we have 


Ctripie = aiGeom(0i) + a 2 Geom(0 2 ), 


(4.8) 


where by inequality (4.7), 


a\9i + q - 2^2 — 


n 


+ 0(n 2 ). 


c 2 m(m — 1) 

3 

This implies that, for some j G {1,2} and all n sufficiently large, cijdj > 3c 2 m " m _ 1 ) ■ 


Assume 


without loss of generality that j — 1. We then have 

i3 


P[ai0i < 


n 


3 c 2 m(m — 1) 

Combining this with formula (4.8) completes the proof. 
We now conclude with the lower bound in Theorem 2: 


]=°(4 
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Theorem 3 (Lower Bound on Mixing Time). Fix an m-regular (m > 1) graph G with no 
triangles. Then the KCIP on G with success probability p = - has mixing time satisfying 


T„ 


> C 


n 


Z c m(m — 1) 


where C does not depend on c or G and 

c 2 

Z r = 


<r max(l, — log 2 ((c + y) yz^)) 


(*+t) 


> o. 


Proof. Fix e > 0, let T = J an< ^ define the set A = {a: G Q : ^2 veG x[v] < 2}. Let 

X 0 be such that Vo = 1. From Lemma 4.1 and Equation 1.2 for the stationary distribution 
7r, we calculate: 

IT(At’) — 7t||tv ^ 7t(A c ) — IPpiy G A c ] 

> 7t(A c ) - P[r tri pie < T] 


c , 


= 1 — n— (1-) 


n 

„2 


n —1 _ 

n' 1 

— C 


n(n — 1) c 2 


(1 


c\n 


n* 


d-d 


n —2 


n 


( 1 -:)” 


+ 0(e) 


— ( c + y) + 0 (!>) + 0(e). 


Thus, for e |(c + il e -c sufficiently small, 


||£(X t ) -7 t||tv > 2 ( c + ~2~) l_ e ~ c 
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> 0 


















uniformly in n > N(e) sufficiently large. Since the mixing profile satisfies 

r (^( c + ~ max ( 1 >- 1 °g 2 (2(c+ y)j^^))r mix , 

by Lemmas 4.11 and 4.12 of [LPW09], this implies 

T 

max(l, - log 2 (2(c + f ) T z^r)) 

for e sufficiently small. This completes the proof. □ 

5. Mixing times of the trace of KCIP on Q k 

In this section, we obtain bounds on the mixing times of the trace of the KCIP on Q k - 
As these trace walks mix substantially more quickly than the KCIP Markov chain X t on Q, 
these mixing time bounds do not need to be tight for arguments to go through. 

Fix 1 < k < |, and let Q n ,k be the kernel of the trace of {W}^ on Vt k (recall that the 
trace of a Markov chain on a subset of its state space is defined in formula (3.3)). Denote 
by T n ,k the mixing time of Q n ,k■ The key result of this section is: 

Lemma 5.1 (Mixing Time of Restricted Walks). With the notation as above, 

r n ,k < Cn 2+ % log(n) 

for some constant C — C(c, k, d ) that does not depend on n. 

We proceed by using comparison theory developed for proving useful mixing bounds for 
a Markov chain by comparing its transition rates to a similar and better-understood chain 
(see, e.g., [DSC93] or [DGJM06] for an introduction to this method). We will use the simple 
exclusion process (SE) on A(L, d) as the basis of our comparison argument (see [CS73, HL75] 
for an introduction to the simple exclusion process). 

Definition 5.2 (Simple Exclusion Process on A (L, d)). The simple exclusion process {Zf} te pj 
is a Markov chain on the finite state space 

n^ = {ze{o,i}»: = (5-1) 

i 

To update Z t , choose two adjacent vertices u t ,v t G A (L,d) uniformly at random and set 

Z t+1 [u t \ = Z t [v t \, 

Zt+i[ v t\ = Z t [ut\ 

and Z t+1 [w\ = Z t [w ] for w £ {u t ,v t }. 

The approach in this section is to first note that the simple exclusion process with k 
particles has good mixing properties (we use the results in [Yau97], though others would 
suffice for our purposes) and then use a comparison argument to show that the mixing 
properties of the trace of the KCIP on cannot be much worse. We use the simple 
exclusion process because it makes both parts of this argument easy: it has already been 
carefully analyzed, and it is similar enough to the trace of the KCIP that the comparison 
argument is short and involves only soft arguments. It is likely that the conclusions we need 
can be achieved by comparison to other processes on the torus. 

ll 






5.1. Comparison of Markov chains using Dirichlet forms. Before stating the main 
result of this section carefully, for the reader’s convenience, we recall some relevant results 
for comparing Dirichlet forms. We use the bounds in [Smil4], rather than the similar and 
simpler results from [DSC93, DGJM06], because we will compare chains (KCIP and SE) with 
different state spaces; the bounds in [DSC93, DGJM06] cannot be used in this situation. 


Definition 5.3 (Norms, Forms and Related Functions). For a general Markov chain on a 
finite state space X with kernel P and unique stationary distribution 7 r, and any function 
/ : X —> R that is not identically 0, we respectively define the L 2 norm, variance, Dirichlet 
form and entropy form as: 


\\f\\l,n = \f ( X )\ 2n ( X ), 

xex 


v Af ) = \ l^( x ) - f(y)\M x )n(y), 

x,y£X 

£p{fJ) = \ I/O) - f(y)\ 2 p ( x ,vM x ), 

x,y£X 


L Af) = J 2 I/O)| 2 !og( 

xex 



(5.2) 


Recall that the log-Sobolev constant of a Markov transition matrix P is given by 

(TD\ ■ C £ P(fJ) f K o\ 

a(p)= }‘i f oX(7r (5 - 3) 

Definition 5.4 (Extensions). Let K , Q be the kernels of two |-lazy, aperiodic, irreducible, 
reversible Markov chains. Assume that K has stationary measure // on a state space 0 while 
Q has stationary measure v on a state space 0 C 0. Denote by / a function on 0, and call 
a function / on 0 an extension of / if f{x) = f(x) for all 16 ©. 


Next, fix a family of probability measures {P x [y]} TG g on 0 that satisfy P x (-) = S x (-) for 
x G 0. We will use only extensions of the form 

f(x) = J2 v MPy'>- < 5 - 4 ) 

We call extensions of the form (5.4) linear extensions. 

Fix a linear extension. For each pair (x,y) e 0 with K(x,y ) > 0, fix a joint probability 
distribution P x ^[a, b] on 0 x 0 satisfying ^x,y[ a i b] = IPj, [6] for all b G 0 and P x,y[ a , b] = 
P x [a] for all a G 0. This is a coupling of the distributions P a ,, P y . 

Definition 5.5 (Paths, Flows). Finally, for each a, b G 0 with y& e b] > 0, 

we define a flow in 0 from a to b. To do so, call a sequence of vertices 7 = [a = 
Vo,a,b,vi,a,b, ■ ■ ■ Wfc[ 7 ],a,fe = b\ & path from a to b if Q{v^ a ,b,v i+ > 0 for all 0 < i < k[ 7]. 
Then let r a y be the collection of all paths from a to b. Call a function F from paths to [0,1] 
a flow if X) 7 er b -P’W = 1 f° r ad a, b. For a path 7 e r a) b, we will label its initial and final 
vertices by i( 7) = a, 0(7) = b. 
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The purpose of these definitions is to provide a way to compare the functionals described 
in formula (5.2). If there exists a family of measures {Px}^© so that the associated linear 
extensions given by formula (5.4) satisfy 

L u (f)<C L L,(f), 

S K (f,f)<C £ S Q (fJ), 

then the variational characterization of a given in formula (5.3) implies 

a(Q) > yrrr a (K). (5.5) 

This is the motivation for Theorem 4 and Lemma 2 of [Smil4], Theorem 4 of [Smil4] may 
be restated as: 


Theorem 4 (Comparison of Dirichlet Forms for General Chains). Let K,Q be the kernels 
of two reversible Markov chains. Assume that K has stationary measure p on state space 0 
while Q has stationary measure v on state space 0 C 0. Fix flow F, distributions P x and 
couplings F X:V as in the notation in Definition 5.f above. Then for any function f on 0 and 
the linear extension f of f on 0 given by formula (5.4) ; 

S K (f,f)<AS Q (fJ), 


where 
A = sup 


Q(q,r)>oQ(q,r)v(q) 


F b] k b] K ( i (.'r),o('y))p(i('y)) 

7 3(q,r) 

+ 2 k b) F b) P J°( 7 )] A ^( 7 ),y)M*( 7 )) 

7 9(9,0 jG§\e 

+ k b\ F b\ W Xt y[i('y),o('y)]K(x,y)fj l (xf). 

7 3(<l,r) x,yeQ\e:K(x,y)>0 


Lemma 2 of [Smil4] may be restated as: 

Lemma 5.6 (Comparison of Variance and Log-Sobolev Constants). Let p be a measure on 
0 and v be a measure on 0 C 0. Let C = sup yg0 AA. Then for any function f on 0 and 

linear extension f of f on 0, 

W) < CVM, 

Lu{f) < CL,(f). 

5.2. Bounds on KCIP. Next we prove our results. Fix 1 < k < | and denote by a U} k the 
log-Sobolev constant of Q n ,k- Denote by Qffj. and the kernel and log-Sobolev constant 
associated with the simple exclusion process with k particles on A (L,d). We then have: 


Lemma 5.7 (Comparison of Log-Sobolev Constants). There exist N = N(c,k,d ) < oo and 
C = C(c, k, d) < oo so that n > N implies 


&n,k ' :> ( & 

n 


SE 
n,k ' 
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(5.6) 




Proof. Our proof consists of comparing a sequence of very similar Markov chains, beginning 
with the trace of the KCIP and ending with the simple exclusion process. The bulk of our 
argument goes through Theorem 4, Lemma 5.6 and inequality (5.5). 

Recall that the state space of Q U)k is Q ntk = tt k as defined in formula (2.1), while the 
state space of Q^ k is as defined in formula (5.1). By the standard ‘birthday problem’ 
computation, 


IT 1 

i =1 


-g (2 d+ 1) ^ < 


n 


l«l 


< 1. 


Thus, for any fixed fceN, 

1 - o(l) < |§S#J < 1 (5.7) 

as n goes to infinity. Since the stationary distributions 7r n fc and 7of Q U)k and Q^ k 
respectively are uniform on Q n , k and respectively, inequality (5.7) implies that 


1 < 



< l + o(l) 


(5.8) 


uniformly in x G £l n> k C . 

Next, we define a less-lazy version of Q n , k . Note that 


Qn,k ( x , x ) > 1 - 5 -, 

n 

and so for n sufficiently large we can define a less-lazy version Q^ k of Q n .k by 

Qn.k = (1 - “)M + -QS, 

n n 

where Id is the identity kernel and Qff k is at least |-lazy itself. Since \ is simply a less lazy 
version of Q n , k , it is immediate that the associated Dirichlet forms £qnl and P-Q nk satisfy 

for all / : Q n)k —> M with / 7 ^ 0. Applying Lemma 5.6 to inequality (5.5), this bound implies 
the following inequality for the log-Sobolev constant of Qn\- 

«n,fc > (5.9) 

n 

Thus, to prove inequality (5.6) it is sufficient to relate Q^j. and . 

Next, we define to be the Metropolis-Hasting chain with proposal chain equal to 

Q® E and target measure n ntk = Unif(h2 nifc ). If x 7 ^ y G PL n , k satisfy Q^ -SE (x,r/) > 0, then 
x and y differ at exactly two vertices u,v, with x[u\ = y[u\ = 0, x[u\ = y[v\ = 1. Let <f >0 = 0 

and inductively let d> l+ i = inf(7 > </>; : Ah 7 ^ X^} be the successive times at which the 

KCIP changes; we calculate 

Tl 

Qn L k( x ,y) > —Q n . :k (x,y) (5.10) 

c 

Tl 

> -P[A 01 [u] = 1\X 0 = x]F[Xfa[u\ = 0|Aq = x,X^[v] = 1] 
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> 


n c £0- - f) 


c ri 2 Ifl _ £1 a. 

rcA n' 

n — c 


2cd{k+l) 

T) 2 




n K n — c + 2 cd(k + 1) 


4( 


n 


2 v n — c + 2 cd(k + 1) 


) 

)«r E (^y). 


To see the second and third lines, consider starting at KCIP at X 0 and calculating the 
probability that X ( p l is obtained from X 0 by changing the label of v from 0 to 1, and that 
Xfo is obtained from X^ by changing the label of u from 1 to 0; these probabilities are at 

least 


-^2 and ! - ” '; 2 ed(fc+i) respectively. By the same short argument immediately preceding 

n ' n'' 

inequality (5.9), inequality (5.10) implies that for all n sufficiently large the log-Sobolev 


constant ay™ SE of Q SE satisfies 


... . 1 , 

^ n,k — n,k 


(5.11) 


In light of this inequality and inequality (5.9), to prove inequality (5.6) it is enough to relate 
<2“fc" SE and Q™. 

We now give the main comparison argument, using Theorem 4. Using the notation of 
Theorem 4, we will compare kernels Q = ~ SE and K = Q^ k on state spaces 0 = 

and 0 = D^ E . Both of these kernels have stationary distributions that are uniform on their 
respective state spaces. To define the flows, distributions and couplings required by Theorem 
4, we need slightly more notation. Define the graphs G^ E ~ SE and G,® E to have vertices 


and edges 


V{G: 


X SE ) = 


ncffi = 


n,ki 

SE 


E(G. 


MH-SEn 
n,k ) 

E(Gl E k ) 


{(x,y)eG^~ SE : g^- SE (x,r/)>0}, 
{(x,y) e G s n E k : Ql E k (x,y ) > 0}. 


We denote by d MH SE and d SE the usual graph distances on G^ E SE and G® E respectively. 
Next, define the distributions {Px}x e nSE fc on ^n,fc by P x [-] = <5 X (-) if x e and 

Pi[-] = Unif({y e tt n ,k ■ d SE (x,y) = min d SE (x, 2 )}) 

z ^n,k 

otherwise. We define the couplings {P®,j/} X! y e n SE ,qse ,< >0 to be the independent couplings 
P x , y [a,b\ = P x [a]Py[6]. Finally, for pairs (a, b) satisfying Ex,yen SE ,Q SE (x, y )>o ^x, y [ a i we 

define the flow F on T 0j & to be uniform on all minimum-length paths in (7 MH ~ SE from a to b. 

We now show that the constant A that the above choices yield is uniformly bounded in 
n. Recall that jSj = 1 + o(l) and Q i s either 0 or 1 when it is defined, and so the 
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constant A can be bounded by 

•4=(i + °(i)) C Y F b\ k b} + 2 Y k b} p b} Y P !/[ 0 W] 

Q( q ,r)>° 7 3(q,r) 7 3(q,r) ye§\0 

+ Y k b] p b] Y p ^(xh°(7)])- 

7 3{q,r) x,yeQ\e:K(x,y)>0 

To show that A is uniformly bounded in n, it is enough to check that all of the probabilities, 
couplings and paths that we have defined are local in the sense that any particular probability, 
coupling or path involve only points that are a bounded distance from each other, uniformly 
in n. Since the details of the bounds are not important to us, we give very loose bounds. 

P x is concentrated on points y with d SE (x,y ) < 2k 2 . ¥ Xty is defined only on pairs (x, y) 
satisfying d SE (x,y ) = 1 and thus is supported on pairs (a, b) satisfying 

d SE (a, b) < Ak 2 + 1. (5.12) 

For x,y G Q n>k , let 

C XtV = max{|n — v\ : u, v G A(L, d), x[u] + y[u] > 1, x[v\ + y[v\ > 1} 
be the maximum distance between any particles in x or y. Set 

R c , n = max{d MH ~ SE (x, y) : C x , y < C}. 

Since ( 7 MH_SE is vertex-transitive and is connected for n sufficiently large, we have for all 
n > N(C, k, d) sufficiently large that 

Rc,n < n c 

for some constant 7 Zc that does not depend on n. Thus, for all n > N(C,k,d ) sufficiently 
large and all x, y with C x<y < C , 

d Mn ~ SE {x,y) <U C . (5.13) 

If d SE (x, y) < C , then there exist k vertices Vi,... , v k G A(L, d) that cover the particles of x 
and y: 

{v G A (L,d) : x[v\ + y[v] > 1} C U k i=l B 2 c(yi). 

By taking larger balls, this can be turned into a disjoint cover: there exist 1 < m < k vertices 
..., y^rn so that 

{v e A (L,d) : x[v) +y[v] > 1} c yJ^ = i B ^{c+n c ){ u i) 

with B 3 k(c+TZ c )( u i ) FI B3k(c+Tic)(. u j) = 0 f° r i 7 ^ .?• By the definition of 7 Zc, no minimal 
path from x to y can have any particles outside of the cover U™ j B: ik (c+nc)( u i)'i thus, by 
inequality (5.13), for all x,y G Q n .k with d SE (x,y ) < C we have that 

d MU ~ SE (x,y) < Rskic+Kc)- 

Combining this bound with inequality (5.12), we conclude that all paths 7 with F( 7 ) > 0 
have at most R:ik(Ak?+\+n Ak . 2+l ) points. Since balls of radius i in the graph A(L, d) have at 
most (2d) e + 1 vertices in them, balls of radius £ in have at most (2dk) e + k vertices 

in them. Thus, at most (2 dk) 3 fc( 4 fc 2 + 1 +^ 4 fc 2 +1 ) p a ti ls j n G“ e_se with positive support can 
pass through any given edge. Most importantly, all of these bounds depend only on k and 
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d] they are uniform in n sufficiently large. Combining this observation with inequality (5.8) 
and applying Theorem 4, this implies 


(/,/) < Ak,d£ Q MH-SE(f,f) 


(5.14) 


for some constant Ak,d that depends only on k and d. Thus, applying inequality (5.7) and 
Lemma 5.6 to inequality (5.5), we conclude that there exists some constant C' kd so that 


MH-SE 

a n,k 


— C'k,d a 


SE 

n,k' 


Combining this bound with inequalities (5.9) and (5.11) completes the proof of inequality 
(5.6). □ 


Finally, we prove Lemma 5.1: 


Proof of Lemma 5.1. Translating the main result of [Yau97] into our discrete-time setting, 
we have 


a 


SE 

n,k 


> C d n 1 d 




for some constant Cd that depends only on d. By Lemma 5.7, this implies 

Oi n ,k > C7i~ 2 ~% 


for some constant C = C(c,d,k) that doesn’t depend on n. Applying inequality 3.3 of 
[DSC96] yields the conclusion. □ 


6. Drift Condition for V t 

Recall the process V t = Yh V £A(L d) from formula (2.3). The graph G t is the subgraph 
of G = A (L,d) induced by the vertices [v e A (L,d) ■ X t [v] = 1}, with vertices V(G t ) and 
edges E(G t ). Let F t denote the u-algebra generated by the random variables {A s } s < t . The 
key result in this section is the following drift condition on {K} s > t : 

Theorem 6.1. There exists some constant 0 < eo = eo(c, d) independent of n so that for all 
0 < e < e 0 , there exist constants Cq = C(e,c,d) < oo, a = a(e,c,d) and N = N(e,c,d ) so 
that, for all k G N and all n > N, 

E[V t+ken s\E t ] < (1 - a) k V t + C G . (6.1) 

Besides a small number of dehnitions that are explicitly recalled, Theorem 6.1 is the only 
part of Section 6 used in the remaining sections. As the proof of Theorem 6.1 is somewhat 
long, we give an outline: 

(1) We show that the number of particles V t is generally close to the number of connected 
components in G t , and so it is enough to bound the latter quantity (see Lemma 6.2). 

(2) By embedding a coalescence process into the KCIP, we show that the number of 
‘collisions’ between components of the KCIP over the time interval {t, i+1, ..., t+en 3 } 
is on the order of the number of connected components in G t (see Lemma 6.15). 

(3) By direct computation, we show that collisions involving components of size 1 will 
decrease a certain biased count of the number of connected components G t (see 
Definition 6.3 and Lemma 6.6) while other collisions will not increase this observable 
by much (see Lemma 6.8). 
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(4) By an argument based on bounding the influence of faraway points, we show that 
a substantial fraction of collisions occur between components of size 1 (see Lemma 
6.9). 

(5) Steps (2)-(4) above will yield that a positive fraction of connected components of G t 
will be involved in a ‘good’ collision over a reasonable time scale, and this leads to 
a contraction estimate on a biased count of the number of components (see Lemma 
6.11). By the observation made in (1) above, this leads to our final contraction 
estimate on V t . 

Recall from Section 4 that for a vertex u, Comport) is the collection of vertices contained 
in the same connected component as u in G t ■ Recall also that Y t denotes the number of 
connected components in G t ■ Define the number of excess particles 

5 S = V S - Y s . (6.2) 

The next lemma compares the number of particles to the number of components. 


Lemma 6.2. For n > 2c and for all s,t > 0 


E[<5 t+S |7i] < (l-—) a 5 t + 4cd. 


Proof. We first show that 


E[<W, U p, s , i<+ . v,<*|.F«] < (l - 1(1 - fjfs, + ^ 1 


n 1 - ■£ 


(6.3) 


(6.4) 


Indeed, for v G A(L, d), let Nf d ^(v) be the number of components of G s that are adjacent to 
v. Define 

= {v G A (L,d) : X s [v] = 1, |Comp s (w)| > 1}, 

B s = {v G A (L,d) : X a [v] = 0, Nf\v) > 0}, 

D s = A(L,d)\(A s UB s ). 

Then we have 

E[5 s+ i|j;] = E[(5 s+ i|j;,n s G A,]P[u s G A,|.F S ] (6.5) 

+ E[5 s+ i| J- s ,v s G R s ]P[u s G B S \J- 5 ] + E[<5 s _|_i| J- s ,v s G D s ]P[v s G D S \J -’ s ] 

— (^s — (1 - ))P[Ds G A s IJ 7 5 ] + (S s H —)P[n s G B S \J - ,] + <5 s P[u s G D s \J r s ] 


n 


n 


— h s P[n s G A s U B s U D S \J- g] — (1 — —)P[n s G A S \J - s ] + — P[n s £ B S \J -’ s ] 

, , S af c v 2 cdV s 

< b s -(1-J H- — ■ 

n n n z 

This can be iterated to give: 

E[(5t+ s )l SU p t < i < t+s Vi<k\Xt] = E[E[(h t+s )l suPt ^.^ t+s Vi <k\X t+ s-i\ \J~t\ 


< E 

< E 

< .. 


(1 - — (1 — -))<<+,-! + 2ciVi *‘~ 1 
n n n z 

, 1 . _ 2 cdk 

(!-- <Wi + — 

n n n 


^upt<i< l+ Ai< fc r 1 


Lsu Pt<i<t+« Vi<k\J~t 
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FIGURE 3. Evolution of the component sizes of H t . 


< ( 1 -!(!-£))■*+* 2 * -L 

v n v n 7 n 1 — 


£ ’ 
n 


which is inequality (6.4). Since V s < n, this implies inequality (6.3) for n > 2c and the proof 
is finished. □ 


6 . 1 . Corrected number of components. Our next goal is to obtain a bound on E[Y)+ S | J-)] 
in terms of Y t over certain time intervals. To this end, we digress briefly and introduce a 
new object called the “corrected number of components” of a graph. 

Definition 6.3. For a graph II. we define a Markov chain {//,},>o with absorbing states as 
follows. Set H 0 = H . For i > 0, if all components of Hi are size 1, set H i+ 1 = H Otherwise, 
select uniformly at random a vertex Uj in H\ that also has at least one neighbour in H, and 
set H i+ 1 = We then define Mh = E^im^oo \Hi\], Since H i+ 1 = H t for all i > \H\, 

this (random) limit always exists. See Figure 3 for an illustration of the evolution of H t for 
an initial graph H 0 with five vertices. 


Definition 6.4. For feN, set 

M k = sup{A/}r : H C A(L, d ), \H\ < k}. 

We make some initial observations. By simple case checking, we can show that M\ = Afo = 
1, A 3 = |,A /"4 = |. We clearly also have 

M k <k, k> 2 . ( 6 . 6 ) 

Next, we show that ACh can never be too small: 


Lemma 6.5. For any subgraph H C A (L,d), we have 


M h > 


\H\ 

2d + l 


Proof. Let 0 be the Markov process given in Definition 6.3, and let Hoo be its limit. 

We have 


Mh — E [ ^2 l^eRoo] — e #00] ■ 

v£H v^H 


(6.7) 


However, for any particular vertex v G H, v G as long as it is the last vertex among its 
neighbours to be selected as an update vertex v t . Since v has at most 2d neighbours in H , 


P[t> £ Hoo] > 


1 

2d + l 
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Figure 4. Star graph 


Combining this with formula (6.7), 


C'H > 


\H\ 


2d + 1’ 

completing the proof. □ 

For 1 < k < 2d , let i7(fc) be the star graph with k leaves (see Figure 4 for a star graph 
with d — 2, k = 4). 

For the purposes of the following lemma, we denote the vertex set of this graph by 
V ( H(k )) = {1, 2,... , k + 1} and the edge set by E(V (. k )) = {(i,k + 1) : 1 < i < k}. 
We give basic bounds on how A/~H(k) depends on small changes to the subgraph H\ 

Lemma 6.6 (Shrinking Star Graphs). For ATh as in Definition 6.3, 


\r k 
A!H(k) ~ ~ 


2 ' ITT < 6 ' 8 > 

Proof. It is easy to check that Mh{ i) = 1 and Nh{ 2 ) = A 3 = |. Let {Hi} ieN be the Markov 
chain described in Definition (6.3), and let {ryjjgN be the associated sequence of vertices. 
The quantity A r H{k) satisfies the recurrence: 

AH(fc) = k¥[v 0 = k + 1] + F[v 0 7 L -k + l]fif H (k-i) (6.9) 

k , k r 

-J\lH(k -!)• 


Iterating, for all 0 < q < k 


A ~H(k) — 


1 


k + 1 k + 1 
3 we have: 

-(k + (k — 1 ) + ... + (k — q)) + 


k 


k + 1 


Aif(fc_ 9 _i). 


k + 1 

Setting q = k — 3 gives formula ( 6 . 8 ). □ 

More generally, 

Lemma 6.7. For any graph G and all subgraphs FI C G and all vertices v G G (adjacent to 
H or not), 

A< Afmj{v} < Ah + 1. 

Proof. We begin by showing A/hu{„| < Mh + 1 by induction on m = |i/|. For \H\ G {1,2} 
this is clear by direct computation. Assume that J\fnu{v} < Ah + 1 holds for all \H\ < m. 
Fix H with \H\ = m + 1, define N(H) to be the vertices of H with at least one neighbour 
in H and, by the same argument as in recurrence (6.9), 

■Vhum = iunr, r.,111 E, 


WifuWJI 


u€N(HU{v}) 


< 


|JV(ffu{«})| 


jV„i i)GJV(I/Up}) + 


|JV(H u{»})| 


Yi (Vh\{«) + 1 ) 


u£N (HU{v})\{v} 
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< Mh + 1, 

where the induction hypothesis is used in the second line. 

We also prove Mh < Mhu{v} by induction on m = \H\. Again, the case that \H\ G {1,2} 
is clear. Assume that Mh < Mhu{v} holds for all \H\ < m. Fix H with \H\ — m + 1. We 
consider two cases: v is adjacent to H or v is not adjacent to H . If v is not adjacent to H, 
we clearly have Mhu{v} = Mh + 1. If v is adjacent to H, then N(H U {u}) = N(H ) U {u}. 
In this case, 

= |jv(//)| + A" + |iv(//)| +1 

-1^)1 + M + |]vmi + i u M 

= Mh, 

where the induction hypothesis is used in the second line. This completes the proof. □ 
6.2. A corrected version of of Y t . In order to bound Y s — Y t , we introduce the process 

Y a = Mh - ( 6 - 10 ) 

HeComp s 

We think of {W} s >t as a ‘corrected’ version of {Yg}s>t- hi general, {Y s } s > t is easier to work 
with than {W}s>t- One reason is the martingale-like property 

E[(Ft+i — Y t ) l pt >£. IF t] = 0, 

which does not hold for {Y s } s >t (here p t is as defined in ( 1 . 1 )). In addition, {W} s >t is much 
better-behaved than {W}.s>t over short time intervals, especially when the KCIP is far from 
equilibrium. Using ( 6 . 6 ), it can be verified that 

W < Us < n + 5 S . (6.11) 

Since 5 S is often small (see Lemma 6.2), Y s is a good proxy for the quantity Y s . 

Next, we show that the corrected number of components {Y s } s > t does not grow too quickly 
over a moderate time period: 

Lemma 6 . 8 . Fix 0 < e < 96 c 2 ^ +1 p ■ Then for all n > N(c,d ) sufficiently large, and 
nlog(n) <s< 9 ^ 3 , 

E[Y t+s \M]<Yt(l + 0(e)) + 0(l). 

Proof. We will show that for 0 < s < en 3 , 

m+.\?A <U 1 + 96 c 2 (d + 1 ) 3 ( 8 n(l - Yy + s )) + —6, + (1 - T)X ( 6 . 12 ) 

v n d 8 n n 8n' 

Since S t < n, Lemma 6.8 immediately follows from ( 6 . 12 ). 

Assume without loss of generality that t = 0 and define 

/w = ^yE[jjj 0 ], 

n ^' 

u =0 


21 










By Lemma 6.7, E[y s+ i|.77,] < Y s + 1 for any time s. Inequality (6.6) provides further nec¬ 
essary (but not sufficient) conditions for Y s+ \ > Y s to hold when the update variable p s in 
representation (1.1) satisfies p s < the vertex v s in representation (1.1) must be adjacent 
to a component of size at least two in G S1 and furthermore we must have X s [u s ] = 0. At time 
s, there are at most 3 dS s vertices satisfying this necessary condition, and so for 0 < s < en 3 , 


E\Y 8 \F 0 ]=E\E\Y 8 \F 8 _ 1 ]\F 0 ] 


(6.13) 


= E 


eJXij;.!, {p s _i < -}]p[p s _i < -IJ's-i] 


n - 


+ E[y s |J r s _i, {p s _ 1 > — }] P [p«—1 > -l^a-l] I^C 


< E[(y a _! + [Ps-i < -IFs-i] + Y s -iW[p s -i > -IFs-i] l^o] 


n 
3 cd. 


n 


n 


<E[F s _ 1 |J'o] + — E^.xIJ-q] 


r 


< 


s-l 


Q rf ] 


u=0 


Lemma 6.5 implies 


K 


Y a > ——. (6.14) 

2d +1 v ; 

Using this fact, and then inequality (6.5) followed by inequality (6.13), we find that for all 
n sufficiently large, 




(6.15) 


u =0 


3 cd 


n* 


(ho + ^EfE^lJ-^lJ-o]) 


U= 1 


3 cd / r v— r / 1 \ r 2cdV u - \ , _ 

s -r(-*o + £e[(i - + -^Wl) 

U =1 

s —1 


< “u+(1 - A) Ew + 


U =0 

s —1 


n* 


u =0 


s—2 


3cd , , 1 N 4-A 4c(d + l) 2 12c 2 (d +1) 3 r n 

- + C 1 “ 4 ^) ^-( s “ 1 ) y ° +- ~4 -J^wE^l^o]) 

u =0 


u =0 

3 cd. 12c 2 (d + l) 3 

< —5o +-^- -is 


n- 


s —1 


n 2 


rr 




8 n J 


u=0 


3 cd x , 12^+l) 3 , lW> , ^ 1 w/ ^ 

= — S 0 +- - 4 -(s - 1)1 0 + (1 - )f(s - 1). 

n z nr 8 n 
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Figure 5. Neighborhood of a Collision 

Inequality (6.13) is used in the fifth step of this bound and we use the hypothesis that 
e < 96e 2(^ +1 )3 in the sixth step. Using this calculation with the last line of inequality (6.13), 
we have 


E[Y s \F 0 ]<Y 0 + f(s-l) 

12c 2 (d+ l) 3 


(6.16) 


< Y 0 ( 1 + 


n 


(» - !)) + + f 1 - ^)/(« - 2 ) 


< 


s-q 


<r»(i + E 

< . . . 


x=0 


12c 2 (d + 1) ; 
n 4 




3cd 


s-q 


1 \ s—n 


■X 


<M i + E 


12c 2 (d + If 


8 n 


1 x,- 


n 


3 cd 


x=0 


8 n J 


8 n' 


-x 


x=0 


rr 




< v;,(i - 


96 c 2 (d + l) 4 


8 n 

1 \ s 


x=0 




n° on n 

This completes the proof of inequality (6.12); by the comment immediately following in¬ 
equality (6.12), this completes the proof of the Lemma. □ 


6.3. Typical component size involved in collisions. In this subsection, we study the 
behaviour of a ‘typical’ collision in the KCIP by looking at the behaviour of the KCIP only 
for vertices and times close to the collision. This approach is illustrated in Figure 5. 

Fix t > 0. Define the number of collisions of the KCIP to be 

C s = \{t < u < s : Y u+ 1 < Y u }|, 
and define the set of collision times 

T co \(s) = {t<u<t + s : Y u+ 1 < Y u }. 
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(6.17) 





































































Recall the definition of the update variable v s used in formula (1.1). For u G T co i(s), we 
define 

M co fu) = sup{| Compete) | : (v u ,w) G E} 

to be the largest component involved in a collision; for u ^ T co \(s) we set .M co i(n) — 0. We 
first show that, given u G 7^ 0 i(s), M. C o\( u ) is often equal to 1: 

Lemma 6.9 (Typical Component Size). There exists 5 = 5(c, d) > 0 independent of n so 
that for any e > 0 

E[|{f + nlog(n) 4 < u < t + en 3 : M. C oi{u) — 1}| | J-)] (6.18) 

> (5 — o(n -1 )) E[|{i + nlog(n) 4 < u < t + en 3 : M co \{u) ^ 0}| | E t ] 
uniformly in the initial configuration X t . 

Remark 6.10. We briefly discuss why the conclusion obtained in Lemma 6.9 is plausible. 
Fix a time t and consider any vertex v and any neighborhood B(v) whose size does not grow 
with n. It is easy to check that over any time interval of length T, with n T n 2 , it 
is very likely that all vertices in B(v) have been updated and also that no particles have 
been added to B(v). On that very likely event, all components of G t+ T 0 B(v) are of size 
1. In other words, after a burn-in time of length 0(n), most vertices are not close to any 
components of size greater than 1. The proof largely consists of checking that conditioning 
on a collision occurring at vertex v at times in the interval (t + T, t + en 3 ) does not affect 
this conclusion too much. 

Proof of Lemma 6.9. Set T = nlog(n) 4 . Let {psjsgpn be as given in formula (1.1) to 

define the dynamics of the KCIP. For fixed v G A(L, d) and t + T < r < t + en 3 , define the 
event 

= GT co i(en 3 ),u r = u}. 

Thus 4(r 11 ' 1 denotes the event that a collision occurs at time r and vertex v. We will show 
that, for any v G A(L, d), uniformly in t + T < r <t + en 3 , 

P[ATcoi(r) = 1|> (5 - oin- 1 )) (6.19) 

for some 5 > 0. Inequality (6.19) will immediately yield (6.18). To see this, (6.19) implies 
that 

E[|{f + T < u <t + en 3 : M. co \{u) = 1} | | E t ] 

t+en 3 

= E E p [Vf ra iM = 1 in.4“" ) W4“ 11, in] 

veA(L,d) r=t+T 

t+en 3 

>(b-o(n~ 1 )) Yi E P [4"" , U r «] 

v€\(L,d) r=t+T 

= (5 - o(n^ 1 ))E[|{t + T <u<t + en 3 : M co \{u) ^ 0}| | Et ], 

yielding inequality (6.18). Thus it suffices to show (6.19). 

We begin with a simple bound on the probability that the number of particles in a small 
region ever goes up by more than a small number over any small time interval. For n G N 
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and 0 < q < 1, denote by Binomial(n, q) a binomial random variable with n trials and 
success probability q. For any fixed t < r < t + en 3 , v G A (L, d ) and £, k G N, we have for 

n > ■>/c(2d)^ +1 , 

r+T c(2d) e+1 

P t E > k] < P[Binomial(T, 

s=r -\-1 


rr 


< P[Binomial(T, -——— 


n z 

< / c(2d) m log(n) 4 x fc 
— 1 n ' ’ 

where the first inequality uses the simple bound |i£^(?;)| < (2dp +1 . 
For v G A(L, d), £ G N, and i + T < r < f + en 3 , define the event 

4r/ se) ={E < si 


)>*] 

)>i] fc 


( 6 . 20 ) 


s=r—T 
i (sparse) 


and let yf( s P arse ) = U t+T < r < f+en 3 iWe A(L i d)Al^“ 0 S o . By inequality (6.20) and a union bound, 

P[^(sparse)|jr t ] > i _ 0 ( n 4 i^EE!) = i _ 0 i(n~ 3 ). (6.21) 

From (6.21), we thus have 

p[vtc„iM= iijp-,, 4" 11 ’] > p[Xocw=li^.^russiPkthoo’V.] 

> P[-M„,(r) = - 0(1))- (6-22) 

In light of (6.22), to show (6.19), it is enough to show 

P[X col (r) = l|JF t , 4 ^] > (i - o(n-‘)). (6.23) 

To this end, we proceed by defining two events of interest. For v G A(L, d), t+T < r < t+en 3 , 
and some constant Bi to be determined later, define the event 

r 

AE 0 = { E 1 P«<f ivueBwoiv) = 0} 

u=r—B\n 

that no vertices are added near v in the (short) time interval of length Bin immediately 
before the collision. Finally, for fcGN and t + T < r < t + en 3 , define the event 


^(regular) _ 


{ ^ ] 1'i'ueBioofi) — 
u=r—T 

that at most k vertices are updated in a ball around v in the interval of length T before the 
collision. 

Fix the constant C\ = 25(2d) 101 . For any t + T < r <t + en 3 , we have 
p[>Mr) = 

>PlAd , (VI = 1 I 75 d( co11 ) /l( s P arse ) .4 (gap) ] y ip [ /I (g a P) I ~p 4 (coll) A (sparse) j 

— IT l co l \> J 1*' 1 5 'S-\-v,r ? 100 J A ^ r^v,r ^ ^v,r,100 J 

> Pl Ad , (VI = 1175 d( co11 ) 4( s P arse ) 4(gap)] V pf 4 (gap) I TT A (coll) a (sparse) . (regular) i 

— V [J Icolv / 1 4 ti i •s~'-v,r, 100 J ^v,r J A ^ rhi,r I*' ti • / ^v,r i‘^■v,r, 100 ’‘ Al ti,r,Ci log(n) 4 J 
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x F[A 


(regular) 
v,r,C i log(n) 4 


jr A (coll) A (sparse) i 

^v,r ? ^v,r, 100 J 


= Ti X T 2 X T 3 . 


(6.24) 


The remainder of the proof consists of obtaining a lower bound for each of the three factors 
T 3 , T 2 and T 3 in inequality (6.24). We begin by bounding T 3 . For a subgraph H C A(L, d) 
and a configuration X G {0, 1 } A( T 4 ), denote by X\h the restriction of X to H ; that is, 
X\h G {0, 1} H and satisfies A|#[u;] = X[w] for all w G H . Let 

= {^h{0), Vtff(l), ■ ■ ■} = {s >t : X s \ H 7 ^ AA+i|iy} 


be the ordered sequence of times at which the restriction of to H changes. Finally, 

let Gv, r be the cr-algebra generated by the random variables {-X’ s |B 100 („)} ae ^ jB w n{ r -T,...,r-i} 

and 'Gbioo(v) ■ For notational convenience, let S t be shorthand for Jy, ^4.^,Too^ an4 

denote by ipBwoM the complement of the set 'ipB 100 (v)- We have 


< 


< 


Ci log(n) 4 
1 

Ci log(n) 4 


E 


E 


1 - I 3 = 1 - E[P[^”g"*(»)<l‘S,.e„]|5 l ] 

i r r -‘ 


E[ E lt> s eBioo(^) I St, Gv,r\ 1*5* 

s=r—T 

(2d) 101 + 16 + E[ 


E 


se{r-T<u<r-l:’u u eBioo(^)}nV>g looW 


(6.25) 

lu s eBioo(^) \^tj Gv,r\ |«St • 

(6.26) 


Inequality (6.25) is simply an application of Markov’s inequality. To obtain (6.26), we split 
{s : v s G £>ioo(v)}n{r — T,r — T + 1,... ,r —1} into two sets: {r —T,... ,r —l}fl'?/'B 100 (^) and 
everything else. Inequality (6.26) then follows from noting that the number \{r — T,... ,r — 
1} Fl 'ipBioo(v )I °f times that a particle is added to or removed from £>i 00 (i’) between times 
r — T and r — 1 is, at most, the number of particles in that set at time r — T plus twice the 
number that have been added between times r — T and r — 1. Thus, conditional on M sparse , 
we have \{r - T,... , r - 1} Fl ^b 100 {v) | < (2d) 101 + 16. 


Fix s G {r — T, ..., r — 1} fl V’Biooh)- We claim that if w i, w 2 G A(L, d)\Bios{v), then 


F[v s = Wi\St,g v>r \ 

P[u s = W 2 \S t , Gv,r] 


(6.27) 


since updates to these two vertices cannot influence anything in £>ioo(u) before time r. More 
formally, for any update sequence {(u M ,p u )}j)~),_ T w hh v u = w±, define the update sequence 
{(?(, Pu)}uZ^r-T hy v' u = v u for m ^ s and v' s = w 2 . This map is a bijection between the 
update sequences allowed by the conditions in the numerator of Equation (6.27) and the 
update sequences allowed by the conditions in the denominator of Equation (6.27); the 
existence of this bijection proves Equation (6.27). We similarly observe that if Wi G B wo (v) 
and w 2 G G\Bios(v), then 


V[V S = Wi\S t ,Gv,r\ < 1 
¥[v s = W 2 \S t ,Gv,r\ ~ 


(6.28) 
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since certain updates within Bi 00 (v) may be forbidden by the conditioning Q v . r . This can be 
made formal in essentially the same way as the argument for Equation (6.27). Inequalities 
(6.27) and (6.28) imply 


bs £ B 100 (v)\S t ,g v>r ] < 

< 


Bioo(v) 


n - | -Bios (v) | 
(2d) 101 


n 


+ o(n ), 


where the second line follows from noting that |£^(u)| < (2d) £+1 . Combining this with 
inequality (6.26), 

(2d) 101 + (2d) 101 log(n) 4 


t 3 = > l 


v,r,C i log(n) 


Cl log(n) 4 


+ o(n ). 


Since C\ = 25(2d) 101 , for n sufficiently large, 
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T 3 > —. 
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(6.29) 


Next we bound T 2 . For H C A(L, d), let 4>h = {<Aff(0), 0#(1), • • •} = {s > t : v s G H} 
be the ordered list of times at which the update vertex v s falls in H. Then let 'H v , r be the 
cr-algebra generated by the random variables {v s : s G 0b 1oo („) fl {r — T,..., r — 1}}. Unlike 
Q Vt r, the update times <Ab 100 („) are not included in this cr-algebra, only the update locations 

v s - 

For notational convenience, let S' t = {J-), }• The gap condition 

A[ S r P ^ is fulfilled if there are no updates to the region 0 (u) in the time interval {r — 
Bin,..., r — 1}, and so 


T 2 = P^l^] 

> EpP[^ Bl00 ( W)r ) fl {r - Bin,... ,r - 1} = 0|«S t ',74,rM- 


(6.30) 


The indices </>b 100 („) fl {r — T,..., r — 1} are, conditional on 'H v<r , a uniformly-generated size- 
|0Sioo(t>)C {r T, ... ,r-l}| subset of {r-T, ... ,r-l}. Since |0 BlooW n{r-T, — , 7 —1}| < 
Ci log(n) 4 by the condition 5 we ^ ave f° r n sufficiently large 


p [V’Bioo(«,r) n {r - Bin, • • •, r - 1} = 0|S t ', H v>r \ 


E 


0<k<Ci log(n) 4 


Cf") 

(D 


B 100 U) n {r - T,... ,r - 1}| = k\S' t ,H v ,r] 


Bin —1 


E II - f - i A lfe.o.w n {r — T, ... , r — 1}| = k\S'„ 

0<k<Ci log(n) 4 ^=0 
Bin— 1 


^ e no- 

0<fc<Ci log(n) 4 i=0 
Bin— 1 

5 n 0 


2k 

~T 


'Bioo(f) n {r - T, ..., r - 1}| = k\S' t , U v<r ] 


2Ci log(n) 4 ^ ^ „_ 4SlCl 


i =0 


n log(n) 


)> 
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Combining this with inequality (6.30) gives 

T 2 > e ~ 4BlCl 


(6.31) 


Finally, we bound the term Ti. Let 

= {b 10 oW C U {«,}} 

r—B\n<s<i —1 


be the event that every element of Bioo(v) is updated during the time interval {r—B^n ,..., r— 
1}. Roughly speaking, for any fixed configuration X G {0, l} A ( L,d ), we will denote by £^f x up) 

all of the updates ‘allowed’ by and More precisely, up ^ is the set 

of updates {(v s ,p s )} r _B in <s<r-i that have the properties: 

• If Ps < f, then v s 4- Biao(v). 

• If X r _ Bin = X and this KCIP process is updated using the dynamics (1.1) with 
update variables {(u s ,p s )} r _s ira < s < r _i, then W-iM = 0 and also there exist two 
neighbours wi, w 2 of v that are in distinct components of G r _i and satisfy X^^w^ = 
^r-lH = 1- 

Similarly, we will denote by S^ x up,) the set of updates ‘allowed’ by , XA'A'’’ and 

VC1 • More precisely, this is the set of updates {(v s , p s )} r -B in <s<r-i that have the prop¬ 
erties: 


•If Ps< then v s B wo (v). 

• Bioo(u) C U^_ Bin {u s }. 

We claim that if v,r,X are such that |^ c ° 1 j7 up ' 1 D £^ C “W UP ' ) | > 1, then in fact 


y v,r,X 

I c (coll—up) ^ ^(cov—up) I 
\^v,r,X {] ^v,r,X I 


I s. 


(cov—up) I 


> (2d + 1) 


-2d 


(6.32) 


u,r,X 


To see this, denote by {i Ui}i=± the neighbours of v and by {wi.jY]=\ the neighbours of Wi. 
Call an element of £^ c °\L up ^ good , if for all 1 < i < 2d, 


inf(s : s > r — Bon,v s = wA > inf infjs : s > r — Bin,v s 

l<j<2d 




In other words, a sequence is good if vertex m,; is not updated until after all of the vertices 
{ w i,j}j=i have been updated at least once. A good sequence will not ever remove a particle 
from a neighbour of v, and the configuration X r _i\ Bl00 ^ resulting from a good sequence 
will have only singletons. Thus, any good sequence will also be in up) if £ <--»P) 

is 

not empty. Since at least one out of every (2d + l)~ 2d elements of I s g°°d, this 

observation implies inequality (6.32). 

Inequality (6.32) can be expressed as 


pr 4 (coll) I 77 A (sparse) . (gap) A (cover)] > /p j , i \-(2d+l)pr A (coll) I -p A (sparse) , (gap)] 

\r^v,r l*' ti ' / ^v,r,100 ? *^v,r ’ ^v,r J — \ ^ vr I*' t? *'H;,r,100 ? ^v,r J* 


We thus have 


Tr = P[A4 co i(r) 
> P[A4 co i(r) 


I I 77 A (coll) A (sparse) 4 (gap)l 

-hI*' 1 1 •S~^v,r ’ v ^*'u,r’,100 ’ v/ *'u,r J 

1 I 77 A (coll) a (sparse) , (gap) A (cover)]p[ A (cover 

- 1 -1*' t? *'Vy,r ? * / T;,r,100 ? ^v,r r^v,r 


(6.33) 

) I 77 A (coll) ,1 (sparse) 4 (gap)] 

W ti * rx v,r > 100 ’ J 
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= 1 x FT l( cover )| Ti i( co11 ) 4( s P arse ) a (g a p)i 

± A - L' Al o,r I ' r ti ) v ^ l u,r,100 ) J 

ipr /l( covel ')| JT A 

_ Tpr / j( coll )| 77 .(sparse) .(gap) .(cover)] L v ’ r I *’ 

lrS;,r !•'£? *^S;,r,100 ’ ^v,r J 


(sparse) .(gap), 
ti,r,100 ) ^v,r J 


> 


p[ A ( co11 ) I T (sparse) .(gap), 

K ti ' / R,r,100 ) S*v,r J 

PT A ( cover ) I TT A (sparse) .(gap)-, 
(Ofj I 1 \-2dpr .(coll) I T . (sparse) .(gap)] jfV^ K ^.r.lOO J 

(za+ij r LA;,r Kt> A>,r,100 > A;,r J pr .(coll), p- .(sparse) .(gap), 

l * r ti 100 J 

( 2 d+i)- 2 d P[^r r) i^, ^,ioo e) , 4 r } i- 


By a monotonicity argument essentially identical to that used to prove Equation (6.27) 
followed by the standard ‘coupon-collector’ bound, there exists some B > 0 so that 


ipr a (cover) i -77 .(sparse) .(gap)] ^ 

^ \r^v,r I*' t) *'*"u,r,100 ? J — 


(6.34) 


for all B± > B and all n > N(Bf) sufficiently large. Choosing B\ — B + 1, and combining 
equality (6.24) with inequalities (6.29), (6.31), (6.33) and (6.34) gives the bound 

P[X ro ,(r) = > Hl e - 4 W+‘) c >(2d + l)- 2d + ofn" 1 ). 

Combining this with inequality (6.22), we have 


P[-Mcoi(r) = 1|7i,4“ U) ] > «5 + o(n- 1 ) 


with 4 = ||e 4 f B + 1 )Ci ( 2 d 4 - 1 ) 2d , verifying (6.23). By the observations made in (6.19) and 
( 6 . 22 ), the conclusion in (6.18) follows immediately and the proof is finished. □ 


Recall the definition of the number of collisions C s from formula (6.17). We will use Lemma 
6.9 to bound the expected change Y t+en 3 — Y t in terms of C s : 


Lemma 6.11 (Comparison of Number of Components to Number of Collisions). Fix 5 = 
S(c, d) > 0 as given by Lemma 6.9. Then for all 0 < e < e(c, d) sufficiently small, 


2(5 

E[V i+m 3 - Y t \B t } < E[|{t < u < t + en 3 : M co fu ) ± 0}| \B t ] 

+ y, 128c2(4+1)3£ (l + o(l)) + 0 ( 1 ), 


where the implied constants do not depend on e or n. 


Proof. Throughout the proof of this lemma, we use ‘update variables’ {p s } se N, {u s } se N from 
formula (1.1). We have 

E \Y t+en 3 - Y t \B t ] = E[ ^ (Y u+ 1 - Y u )\B t ] (6.35) 

t<u<t-\-en 3 :Mcoi{u)=l 

+ E[ J 2 {Yu+i-Y u )\B t ] 

t<u<t-\-en 3 : A / l C o\( u )^/ z ^ 


= Ti + T 2 . 
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We first estimate the term T 2 . From Definition 6.3, we have that E[(Y u+ i — Y u )\p u > ~, = 

0, and so 

T 2 = E[ (^*+1 - Y u )l Mcol (u)^i1-pu<±\Ft]- (6.36) 

t<u<t+en 3 C ol( w )^l 

By Lemma 6.7, the corrected component count Y u cannot change by more than one when 
vertices are added and no collisions occur: 

|(W+i ~■ Y u ) 1 m co i(u)^i ^-Pu<% 1 — 1- 

Finally, if \H\ < 2, then Mh — 1- In particular, adding a vertex to X u can only increase Y u 
if it connects two components or if it is added to a component that already has at least two 
vertices. Since there are at most 4 d5 u vertices adjacent to such a component, 

~ ~ 4 cd 

E[(Y u+ i - Y u )l MciA {u)?ilpu<$\Ft] < — 


7l~ 


Combining this bound with inequality (6.36), 


4 . -| 
Ts — n 2 X/ 


(6.37) 


U=t 


By inequality (6.15) and the calculation in inequality (6.16), we have 

t.+en 3 inn J ^ , -| \3 


4cd r x i 771 ^ 1> 128c (d+1) ( ^ 1 yd ^ 32 cd 4^ j- 

—E[ ^ 5 u \T t \ < Y t - — -(8n(l - —) + s) + —8 t + ^(1 - 


U=t 


n° 


8 n' 


n 


8 n J 


Combining this with inequality (6.37), this implies 


T 2 < Y t 


<Y t - 


128c 2 (d+ 1) ; 

3^ 

128c 2 (d+ l) 3 e 


32cd, 


( 8 "(1 “ 8 ^)'” + £ " 3 ) + n 5t + (1 “ Sj )™ S> 

(l + o(l))+ 0(1). 


(6.38) 


Next, we turn to bounding T|. By Lemma 6.6, it follows that 
Ti=E[ ^ {Y u+1 -Y u )\F t ] 


t<u<t-\-en^ \ Ai co \(u)=l 


< ——E [I <u<t + en 3 : M col (u ) = l}| | Jj] + 0. 

By Lemma 6.9, this implies 

2 $ 

Ti < —— E[|{t + nlog(n) 4 < u < t + en 3 : M co \{u) ^ 0}| |J)] + o(n~ l ) 
o 

Thus 

E[|7^oi(en 3 ) D {t, t + 1,..., t + nlog(n) 4 }| | Ft] 

< E[|{t < u < t + nlog(n) 4 : V u+1 > V u }\ \T t ] 

t-\-n log(n) 4 

T ' 1 Si 


(6.39) 


2 cd r v—-y . i 

- X^ ^ 

u=t 
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4 cd(d + 1) ,+ "«"> 4 

^ —2 E 1 E 

u=t 

< 4cd(d + 1) ,+ ’g” )4 




u=t 


-O/i , 96c 2 (c£ +1) 3 ^ ^ 1 ^ 24cc? ^ 1 

M 1 + -Ts-( 8 n(i — —) +«))+— s t + (l~—) 5t 


n J 


n 


8 n J 


<y l(1+o(1)) M!!)! + 0( 1) 


n 


(6.40) 


where the third inequality is due to inequality (6.14) and the fourth inequality is due to 
Lemma 6 . 8 . Combining inequalities (6.39) and (6.40), we have 

2^ 

Ti < —— E [| {t <u <t + en 3 : M co \(u ) ^ 0} | | J)] 

+ ^( 1 + 0 (i),Mhl + 0( i), 

3 n 

Combining this inequality with inequalities (6.38) and (6.35), we have 

*26 

E[F t+en 3 - Y t \X t ] < —— E[|{t <u<t + en 3 : M co i(u ) ^ 0}| \X t ] 

+ ^.(i + o(i))!^ht + y mc^d+ay + + 0(1) 

28 

= —— E[|{£ < w < t + en 3 : .M co i(m) 7 ^ 0}| | Xt] 

+f W |1+ „ (1 ) )+0(1 ), 


and the proof is hnished. 


□ 


6.4. Color constrained Ising process. Our next goal is to prove a lower bound on the 
expected number of collisions. To this end, we define a ‘colored’ version of the KCIP on 
a general finite graph G, which allows us to make rigorous the notion of a single particle 
moving and branching over time. 

Definition 6.12 (Color constrained Ising process). Fix £ G N, 1 < k < n and x G {0,1} G so 
that the subgraph of G induced by the vertices {u G G : x[v\ = 1} has exactly k connected 
components. We define a Markov chain {A" s } s > t on the state space {0,1,..., k} n that is 
closely coupled to the KCIP {X s } s > t started at X t = x ; in fact, we will have X s [u] = 1 y rqyo 

for all v G G and s > t. We begin by setting the initial condition X t . For a fixed v G 
G, if X t [v\ = 0, we also set X t [v] = 0. Fix an ordering of the k connected components 
q[1],... , c t [k] of G t , and set X t [v\ = i for all v G c t [i\ and X t [v\ = 0 for all v ^ U ^ =1 c t [i\. 
Note that this arbitrary ordering and labelling of the components is done once, at time £. 
We do not reorder components at times s > t, and we will always have X s [v] in the set of 
labels {0, 1 , ,k}, even if the number of components at time s is not equal to k. Indeed, it 
will turn out that with probability one there exists a (random) index i G {1,2 ,...,&} and 
time S' > £ so that X s [u] G {0, i} for all s > S. 

To evolve X s , recall that {Ab,}<,> t evolves by selecting at every time s a vertex to update, 
and sometimes changing the label of that vertex. Whenever the labelling of a vertex v is 
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changed from 1 to 0 in X s , the labelling of v should also be changed to 0 in X s . Whenever the 
labelling of a vertex v is changed from 0 to 1 at time s in X s , choose a vertex u s ~ Unif {w : 
(w,v) G E, A, s [u;] = 1} uniformly at random from the neighbours of v that have a non-zero 
label in X s , set A" s+ i[u] = X s [ii s ], and then set A" s+ i[u>] = X s [u s ] for all w G Comp s (-u s ). All 
other labels of A" s+ i should be the same as that of X s . Since entire components can ‘flip’ 
colours, the process {A s } s > f will sometimes have several labels change at once. 

We now give some definitions related to the colored KCIP. For all s > t define Comply to 
be the connected components of G s containing only vertices u satisfying X s [u] = i. Dehne 
the number of vertices with color i by 

y (0 V 1- 

V s - L X s [u]=i 

u£A(L,d) 

and the associated interference time by 

Cint = inf {s > t : {|K W - K-il > !} u (| I Compel - | Compel | > 1}} . 

We also provide a generalization of the definition of the triple time in formula (4.3) 

CfiU = inf{s > t : V® > 3}. 

This allows us to state the following corollary of Lemma 4.1: 

Corollary 5. Fix i,t G N and e > 0 and assume that I'f'' 1 = 1. Then 

p [4:U - 4 < 4»)] = °(4 

Proof. We reduce this to the case of Lemma 4.1. Let v G A(L, d) be the unique vertex with 
X t [v\ = i. If Z2ueA(Ld)Xt[ u \ > I; we can couple {A" s } s > t to a second KCIP {X l s \s>t^ with 
initial condition X[[u] = l u =v in the following way. Let p s , v s be as in representation (1.1). It 
is possible to obtain a third vertex with label i only by either growing the single component of 
color i to size 3 without having a collision with any other component, or by having a collision 
with another component. Thus, for any constant 0 < C < oo, if Ctripie < nrin (C, £^) holds 
for the process {A" s } s > t , it must also hold for the process {A'} s > t . Thus, for the purposes of 
proving inequality (6.42), it is enough to prove it in the case that X t [u\ = l u=v ; but in that 
case, the conclusion follows by Lemma 4.1. □ 

We next recall the Coalescence process ([CS73, HL75]): 

Definition 6.13 (Coalescence Process). Fix a graph G, k G N and q G [0, 4], A coalescence 
process on graph G with k initial particles and moving rate q is a Markov chain {Z s } sG h on 
G k . Let O s = {v G G : 3 1 < i < k such that Z s [i\ = v} be the occupied sites of Z s . To 
evolve Z s , we first choose u s ~ IJnif([0,1] ), v s ~ Unif([O s ]) and u w ~ Unif([ J Bi(n s )\{n s }]) 
and set I s — {i : Z s [i\ = v s }. If < g|O s |, then set Z s+ i[i] = u w for all i G I s and set 
Z s +\[j] = Z s [j] for all j <£ I s ; otherwise, set Z s+1 [j] = Z s [j ] for all j. 

Remark 6.14. The coalescence process has many other names and descriptions. The con¬ 
struction of the coalescence process as the ‘dual’ to the voter process is well known (see 
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(6.41) 


(6.42) 




Figure 6. Embedded Coalesecence Process 

[HL75]). We can also view the coalescence process with k initial particles as k random walk¬ 
ers that take turns making independent simple random walk steps until a collision occurs, 
at which point the colliding particles ‘merge’ into a single particle. After this collision, the 
coalescence process resumes with k — 1 particles. 

6.5. Coupling KCIP with coalescence process. Next we couple the KCIP with the 
coalescence process to obtain a lower bound on the number of collisions C s (see formula 
(6.17)). 

Lemma 6.15. With notation as above, there exist constants k = n(e,c,d) > 0 and C = 
C(e, c, d) < oo that do not depend on n so that 

Ept+en 3 !^ 7 1] A K Yt — C. 

Proof. Roughly speaking, our strategy is to show that the KCIP has ‘almost as many’ colli¬ 
sions as an associated coalescence process on A(L, d). Recall the ‘color constrained’ KCIP X s 
from Definition 6.12. We define {Q s [f]} s >t+n 2 -5 1 < i < Y t to be a collection of (1 — ^)-lazy 
simple random walks on A (L,d), with starting points X t+n 2.s[Q t+n 2.5[i]] = i. These walks 
will not be independent; we will later define a coupling of these walks to X s . We will use 
the properties of this coupling to show that the following all hold: 

(1) For 1 < i < Y tl let T(i) be the largest integer so that Q s [i\ £ (A : X, s [v] = i} holds 
for all t + n 2 5 < s < T(i). We show, with high probability, that T[i ) is ‘quite large’ 
for ‘many’ values of i. 

(2) The expected number of ‘near-collisions’ between processes Q s [i],Q s [j], i j is at 
least some fixed fraction of the expected number of collisions in a coalescence process 
{R s } S > t+n 2 .s with initial particles at (Qt+n 2 - 5 [1] , ■ • •, Qt+n 2 - 5 [Yt])- See Figure 6, showing 
the particles of the colored KCIP X t as well as ‘crossed-out’ vertices representing R t . 

(3) Theorem 5 of [Cox89] implies that this last number is ‘almost’ as large as Y t . 

(4) The expected number of collisions in the KCIP is at least some fixed fraction of the 
expected number of near-collisions. 

We now give the formal argument. Assume n > c and fix an index 1 < i < Y t . Let 

T s ( tart = inf{s > t : Vwi,w 2 e A (L,d), {X s [^i] = X s [w 2 ] = i} => {(w 1 ,w 2 ) £ E}} 
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be the first time that no two particles colored i are adjacent. If the set {v : X (») [n] = i} is 

'start 

non-empty, choose a vertex v from that set and let Q (*) [?] = v. Although we have not yet 

^start 

described the evolution of {<5 S [*]} S> n) , we define the associated random time 

T tripie = i n fI s > Atart : the component of G s containing Q s [i] has at least three elements.}. 

Define the lifetime l s [n] of a particle at v with X s [n] = i to be 

£ s [v] = sup{T > s : V s < u < T, X u [v\ = i and v u v] 

when that set is non-empty, and £ s [n] = 0 otherwise. Again, without having defined the 
evolution of (Q,.[?]} . h) , we define the decoupling time of colour i to be the minimum of 

the first time that the particle at Q s [i] does not have colour i and the first time that there 
is a size-three component of colour i: 

decoupling = min (triple) inf ( S > Atari : X a [Q a [l\] 7^ l}) ■ (6.43) 

We now describe our coupling of the processes (<5 S [?]} to {X s }. For r s ^ rt < s < decoupling’ 
we evolve Q s [i] by always setting 


Qs+i[i] = argmax„ eComps(QsW) 4[n]. (6.44) 

When s > decoupling) we are n °t interested in <5 s+ i and do not make use of its evolution 
beyond this time. Finally, define the ‘near-collision’ time 

dear = inf(s > T s drt : there exists an element of X s at distance 2 from Q s [i]}. 

First, we claim that decoupling — m hi (duple)dear). This is because, for all of the color-?' 
vertices near Q s [?] to be removed, there would have to be a collision involving color-?' vertices 
near Q s [i\, and thus also a near-collision. Observe that, conditioned on dtirt; the distribution 
of the path 


Sn r •lX min ( T triple’ T ^ r )- 1 
iVshJ/ (i) 


is equal to that of a simple random walk on G with holding probability 1 — 

We now couple a coalescence process {i? s } s >t+ n 2 S with Y t initial particles and moving rate 
to {<3s}s>t+n 2 -s. We begin by setting the initial conditions i? f+n 2.5. If T s d rt < n 2 ' 5 , we 
set R t+n 2.5[i] = Qt+n 2 - 6 [*]- Otherwise, choose R t+n 2 .s[i\ uniformly at random from among all 
vertices not already containing a particle. Let T> denote the pairs of indices (?, s) that satisfy 


(?) ^ 
Atart < n 


2.5 


< s < min(V. 


(b 


triple 


r. 


(b 


(6.45) 


For (?, s) G V, we set R , s [?] = Q s [i\- For (?,s) f V, we choose i? s +i[f] conditional on R s 
and {i? s+ i[j]}(j !S ) e x> and independently of all other random variables being discussed. We 
now argue that such an extension from the indices (?, s) G T> to all indices 1 < ? < Y), 
t + n 2 ' 5 < s < t + en 3 is possible. To see this, it is sufficient to check that at each (?, s) G V, 
we have: 

• Q s [i] evolves as a simple random walk with holding probability 1 — ^ (he., each 
particle moves as a simple random walk). 
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• If (j,s) G D for some j ^ i, at least one of Q s+ i[i\ = Q s [i\ or Q s +i[j] = Q s [j] must 
hold (i.e., two particles can’t move at once). 

• If ( j , s) EV for some j ^ i, Q s \i\ = Q s [j] cannot hold (i.e., no two particles that are 
being conditioned on have collided). 

These three conditions, taken together, characterize the one-step dynamics of a subset of 
particles evolving as a coalescence process. 

This completes the definition of our coupling of { R s }t+n 2 - 5 <s<en 3 to {A" s } t+n 2 . 5 < s < en 3 . It 

has the critical property that, for t + n 2 ' 5 < s < ^decoupling> we have X s [i 2 s [z]] = i. 

Say that color 1 < i < Yj has a near-collision between times t\ and t 2 if there exists a time 
t\ < s < t 2 and a pair of vertices u,v so that X s [u] = i, A s [u] = 1, u,v are not in the same 
component of G s and \u — v\ = 2 . Say that color i has coalesced by time t\ if R s [i] = R s [j] 
for some j ^ i and s < t\. The color i has a near-collision between times t and f + en 3 unless 
at least one of the following four events occurs: 

( 1 ) 2 . 5 , where for 0 < u < ri 2 ' b we define Ai \ to be the event that r^| rt > t + u and 
i has no near-collisions between time t and t + u. 

(2) 4‘>- 


(3) 4° 

(4) 4 ° 


^"start < t + n 2 ' 5 but color i has not coalesced by time t + en 3 . 

(i) (?) 3 

"Triple < T ^ear < en A . 

'ritart ^ t + n 2 " 5 and color i has coalesced, but for all pairs s, j so that t < s < en 3 
and Rf 1 = R ( i\ we also have r^J coupling < r^L- 
We now bound the probabilities of these four events. Noting that color i has a near-collision 
if there are ever two components of color i (since the two components must be at distance 
exactly 2 when they are first separated), we have by essentially the same calculation as 
inequality (6.5): 




(*) 


w+.i < 4+,+i - —(i - r ) + 

■m.s+i n n 


c, . 2dV t %c 


n 


n 


< vi$. 


(l-l(l- c ( 2d+1 ) )). 
n n 


Thus, for n > 2c(2d +1), we iterate and find 

E[44,Ue, 


1 ,n z 


| Tt\ < ne 


Since V t ^ s is at least 1 for all t + s < TneL-, applying Markov’s inequality gives 


< E[y t 


(0 

t+n 2 - 5 


1 40 


\Rt\ < Tie 


(6.46) 


Next, by Theorem 5 of [Cox89], there exists a constant C = C(e, c, d) that does not depend 
on n so that 


E[£ l 4 .»l < C 


(6.47) 


for all n sufficiently large. By Corollary 5, 


E[£ q w ] = 0(e)Y t . 


(6.48) 
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To bound A ^, we consider the collection of colors j that are given in the dehnition of A$. 
We say that these colors are involved in an ‘unrecorded collision,’ as the collisions described 
in event do not contribute to our count of the total number of near-collisions of the 
KCIP. Observe that any particle R s [j ] that is involved in such an unrecorded collision at 
time s necessarily coalesces with another particle R s [j'] during the course of the collision, 
and also must have Tr2 oupling — s • Thus, each decoupled particle can be involved in only one 
unrecorded collision before being merged with another particle, and so 



- 2 E 1 ^ i) n 2.5 + 1 4 i) + 


Denote by A/" s near the total number of near-collisions between times t and s. 
inequalities (6.46), (6.47), (6.48) and (6.49), we have that 


(6.49) 

Combining 


E[ACM > y,( 1 - 0(e)) - 0(1). 


(6.50) 


Finally, we must relate the number of near-collisions to the number of collisions C t+en 3 : 


Proposition 6.16. Fix e > 0. There exists k = n(c,d ) > 0 so that for all n sufficiently 
large, 

E[C t+en s +4n 2. 5 ] > kE[A&]. 


Proof of Proposition 6.16. This amounts to checking that, once two components Ci,C 2 of 
G s have vertices w 4 G C 4 , w 2 G C 2 at distance 1, there is a positive probability that all three 
of the following (purely local) events occur: 

• The particles at Wi survive longer than any of their neighbours. 

• A particle is added to the common neighbour v of w 4 ,w 2 before any other particle is 
added to the set Biiwf) U Bi(w 2 ). 

• Some particle is added to the set B 1 (w 1 ) U Bi(w 2 ) before time s + 4n 2 ' 5 . 

The first two events are purely local, and the last occurs with probability at least 

1 — (1 — 4) n ~ J ~ 1 — e~ Cy ^. Thus all of these events have probability bounded away from 
zero, and the proof is finished. □ 


Combining Proposition 6.16 with inequality (6.50), and noting that n 2 " 5 
the proof. 


o(n 3 ), completes 
□ 


Now, we are finally ready to give the proof of Theorem 6.1, establishing a drift condition 
for V t . 


Proof of Theorem 6.1. Recall the definitions of 5 > 0 from Lemma 6.11 and k > 0 from 
Lemma 6.15. Then 

E[V t+en 3\F t ] = E[V t+en 3 - Y t+en 3\R t \ + E[Y t+en 3 - Y t+en 3 ] + E[Y t+en s - Y^Ft] + Y t 

< 4cd(l + o(l)) + E[Y t+en 3 — Y t+en 3 | t] + E[y)+en 3 — Yt\J~ t] + Tt 

< 8cd(l + o(l)) + E[Y t+en 3 — Yt\Tt] + Y t 

< — -^-E[C t+en 31 R t] + O(eYt) + 0(1) + Y t 

< (1 - ^f)Y, + 0(eY t ) + 0 ( 1 ) 
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< (1-— + 0(e))V, + 0(l), 

where the first inequality comes from Lemma 6.2, the second inequality comes from inequality 
(6.11) and a second application of Lemma 6.2, the third inequality comes from Lemma 6.11, 
the bound in the fourth inequality comes from Lemma 6.15, and the final inequality comes 
from the bound Y s <Y S <V S (see (6.11)). 

Fixing e > 0 sufficiently small, then, there exists some constant C > 0 so that 

S Kj 

nv t+en3 \T t ] < (i -—)v t +c. 

Write a — y- Iterating, we have 

E[ "Yt+kenA < E[E[. . . IE[V^ +fce „3 • • • | J^t+en 3 ] I ^t] 

< E[E[... E[(l - a)v t+ (k-i)en3 + c\t + (k - 2 )en 3 }... 1 1 + en 3 )\T t \ 

< (1 -a) k V t + C G 

for some constant Cg and the proof is finished. □ 

7. Excursion Lengths of KCIP 
Fix eo small enough so that Theorem 6.1 applies and set 

tax = 4— (7.1) 

a 

where Cq, ol are as defined in inequality 6.1 of Theorem 6.1. In this section, the drift condition 
for Vt obtained in Theorem 6.1 will be used to show: 

(1) The distribution of the first hitting time of Ui<fc<fc max flfc is 0(n 3 log(n)), uniformly in 
the starting point A" 0 (see inequality (7.5)). 

(2) T>n 3 log(n) implies that 

X] 1 ^eu 1 < fe < fcmax o fc > n 3 log(n) 

t<T 

with high probability, uniformly in the starting point X 0 (see Corollary 6). 

Items (1) and (2) above thus provide strong bounds for the occupation times of KCIP on 
fife, uniformly in k < k mSiX . We start with the following elementary technical lemma. 

Lemma 7.1. Fix 0 < f3 < 1 and 0 < 7 < 00 . Consider a stochastic process on N 

with associated filtration J t that satisfies the drift condition 

E[J s+ i|77s] < (1 — (3)J S + 7 

for all s G N. Let Z 1 ,Z 2 ,... be an i.i.d. sequence of random variables with geometric 
distribution and mean |. If J 0 < f or a H T £ N we have 

t c 

p Eu.<f <7 <r['Z z <>T\- 

s=0 i=1 
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Proof. Assume Jo < -jf and let r ret = inf{s > 0 : J s < y}. We have 


and so by Markov’s inequality 

P[r ret > s] < P[j»lr r „>, > j] < (1- f )'• (7.2) 

Define i 0 = 0 and t i+ 1 = inf{s > U : J s < By inequality (7.2), 

P[*«+i - U > s \{tj}j<i) < (! - ( 7 - 3 ) 

We have 

T C 

P E h.<if < C] < n*o > T] — P[£> - t,-i) > T}. 

s= 0 i=l 

Inequality (7.3) implies that the distribution of (U+i—ti) is (conditionally on stochas¬ 

tically dominated by a geometric distribution with mean this completes the proof. □ 

Define the set 

1C = {x G {0,l} A(L ’ d) ; ^2 x[v] < k max }. (7.4) 

v€A (L,d) 

We apply Lemma 7.1 to the KCIP on A (L,d), with J s = V s and Jt = T t to obtain the 
following corollary. 

Corollary 6. Fix e sufficiently small so that Theorem 6.1 applies, and let a,Cc be as in 
Theorem 6.1. For fixed C 2 andCi > ^C 2 sufficiently large, alln > N(c,d ) sufficiently large, 
and any starting point A 0 G Cl, 

Cin 3 log(n) 

P[ i-Kte/c < C 2 n 3 log(n)] = 0(n“ 5 ). 

t =0 

Proof. Let r sta rt = inf{t > 0 : X t G 1C} and fix k G N. By Theorem 6.1, 

E[I4 m 3l T8tart>fem 3] < (l - -a) V 0 , 

and so by Markov’s inequality and the trivial bound that V) < n for all t, 

P[Atart > ken 3 } < P[I4 en 3l Tstart>feen 3 > 1] (7.5) 

<n(l-^a) k . 

Fix TgN and let be an i.i.d. sequence of random variables with geometric distri¬ 

bution and mean 7 . By inequality (7.5), the Markov property and Lemma 7.1, 

Cin 3 log(n) Cin 3 log(n) 

P[ ^2 ixteAC > C 2 n 3 log(n)] > P[ ^ l Xt eK > C 2 n 3 log(n)\T stavt < T]F[T stait < T] 
t =0 t =0 
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= P^start < 


T C\n 3 log(n) 

r ]E p [ £ 1 Xt(zK. C 2 TI log(?7.) |T s t ar t ^IPf^start ^l^start A T] 

t =0 t= 0 


1 T T C 2 n 3 log(n) 

> (l - n( 1 - -a) L ^ J ) ^ Zi - C 'l^ 3l °g( n ) - ^P^start = 1 1 T sta rt < T] 

t= 0 i=l 

^ T C 2 n 3 log(n) 

> (1 -ra(l - -a) L ^ J )P[ Zi < Cin 2, log(n) — T]. 

2=1 

Choosing T = [^n 3 log(n)J, we have for C\ sufficiently large (e.g., larger than 200a suffices) 
that 


Cin 3 log(n) 

n Y1 lx ^ > lo s( n )] > (1 - n~ b ) (1 - e-%"" 3 log ^), 

t =0 

where the second part of the above inequality follows from a standard concentration inequal¬ 
ity for geometric random variables. This completes the proof. □ 


7.1. Bounds for the collision times of coalescent process. Recall fc max from Equation 
(7.1) and the set from formula (2.1). In this section we obtain estimates for the collision 
times for a coalescence process (see Definition 6.13) started in fl*.. All of our bounds are 
based on soft arguments and are immediate consequences of results from [Cox89]. These 
bounds will be used for obtaining estimates of occupation times of KCIP in Qk in the next 
section. 

Let be a coalesence process on A(L, d) with fc max particles and moving rate q = 

Define the process W t on {0, 1} A ( L ’4> ; 

W t [v\ = 1 3i:Zt[i\=v (7.6) 

W t will often be referred to as ‘the’ coalescence process, as it is a Markov chain and 
can be reconstructed (up to permutation of labels) from {ITtjtgN- We give some notation 
related to the ‘skeletons’ of our processes of interest. Define the sequence of times 0 O = 0 
and 

& +1 = inf{i>& : Wt^Wfr}. (7.7) 

These are the times that {Vh 7 t} teN changes. 

Remark 7.2. Let {</>'}j> 0 be a sequence of i.i.d. geometric random variables with mean 4 
and define {W/}t >0 by 

K = w* 

for t satisfying ^ ■ Th e P rocess (R 7 * }t>o is still a coalescence process in 

the sense of [Cox89]. Furthermore, 

{^}ieN = {^} ieN (7.8) 


E[</>i] 


n 


Ak mxx cd 


E[^]. 
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Define the first collision time as 

r col = inf{f : \W t \ < \W 0 \}. 
For the remainder of this section, define for f > 0: 




- {w £ {0, l} A(L ' d) : inf \u - v\ > C}- 

u,v: w[u\=w[v]=l 


(7.9) 


(7.10) 


We show that, for ( sufficiently large, collision times are not ‘too small’ when started from 

Qf n n k : 

Lemma 7.3. Let be a coalescence process with 1 < k < fc max initial particles on 

graph G = A (L,d) and let {Wfjigpj be defined as in formula (7.6). Then, for all 0 < 5 < 1, 
there exist e = e(c, d, fc max , 5) > 0 and C — C(c, d, fc max , 5) so that, for all n > N(c, d, /c max , 6), 

P[r co i < en 3 \W 0 = w] <1-6, 

P[Tcol < fien\W 0 = w] < 1 - 6 

uniformly in 1 < k < fc max and w 6 Qjfy D Lt k . 


(7.11) 


Proof. We begin by proving the first half of inequality (7.11). By Theorem 5 of [Cox89], for 
any fixed e > 0, any sequence a n satisfying lim^oo a n = oo, and any sequence G GaH, 
we have 

lim P[r co i < en 3 \Wq = w^] = /(e, c, d,k) < 1 

n—>-oo 

for some explicit function / that satisfies lim e _>o f(e,c,d,k) = 0 for d > 3 and all c,k. 
Defining /(e, c, d) = max 2 < fc < fcmax /(e, c, d, k ), this implies 

lim P[t co i < C7i 3 \W 0 = w^] < f(e,c,d ) < 1 (7-12) 

n—>■ oo 

and that lim e ^ 0 /(e, c, d) — 0 for all d > 3 and all c. 

The remainder of the argument is a proof by contradiction. Fix 6 > 0 and choose e > 0 so 
that f(e,c,d ) < 1=^. Assume that inequality (7.11) is false. Then for all C > 0, there exists 
a strictly increasing sequence of integers {n; = nfiC)}^ so that 

sup P[t co1 < e7if\W 0 = w^] >1 — 6. 

Let {C'jjjgN be a sequence satisfying lim J ^. 0O C'j = oo, and for each j let the increasing 
sequence of integers {n 8J - = nfiCj)}^ satisfy 

sup P[t co1 < en 3 j\W 0 = > 1 - 5. 

Then the diagonal sequence n hl satisfies 


liminfj^oo 


sup P[r co i < en 3 j|W 0 = w^ ni ^] >1 — 6. 


(rij j) 

W K 


Since f(e,c,d) < ~y~ by assumption, this contradicts equality (7.12), completing the proof 
of the first half of inequality (7.11). The proof of the second half of inequality (7.11) is 
essentially identical; simply follow the same steps for the sped-up process {lF/}*gN described 
in Remark 7.2 and use formula (7.8) to relate this back to {Wt}tgN. □ 
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Figure 7. Short Path 



Lemma 7.3 has the following strengthening as an immediate corollary: 

Corollary 7. There exist e = e(c, d, /c max ) > 0 and 5 = S(c, d, fc max ) > 0 so that, for all n 
sufficiently large, 

P[t co i < en 3 \W 0 = w] < 1 - 8, 

nr co i<(pen\W Q = w}<l-6 (7.13) 


uniformly in 1 < k < k max and w e D*,. 

Proof. Define the directed graph X to have vertex and directed edge sets 

V(X) = K, (7.14) 

E(X) = {(x,y) : x, ye 1C, ¥{W 1 = y\W 0 = x\ > 0}. 

Our key observation here is that it is possible to change any initial configuration w G fC into 
a configuration w' e Gq by making some number of moves along the edges of X, where 
the number of moves required is uniformly bounded in n (but may depend on C, d or fc max ). 
Since each move takes 0(n 2 ) steps in the coalescence process, and each move has probability 
bounded from below, this allows us to apply Lemma 7.3 to any initial configuration. 

More formally, we begin by proving the first half of inequality (7.13). Fix w e any 
0 < 7 < 1 and let e = e(c, d, fc max , 7), C = C(c, d, fc max , 7) be constants given by Lemma 7.3. 
Overloading notation slightly, if x, y e X, we denote by \x — y\ the length of the shortest 
path from x to y in X\ see Figure 7 for an example of such a path when d — 2 and C — 2. 
Set 

w' = argmin{|w — u\ : 116 flO*,}. 

Let T = {wq = w, w \,..., w\ w - w '\ = w') be a path from w to w' satisfying |Wi — Wj+i| = 1 for 
all i. We have 

nm 0 ,W ^,..., W W| ) = r |Wo = w] > ( 2 dfc max )-l—'I (7.15) 

> ( 2 d/c max )" fc ^ x(c+1) . 

Thus, applying Lemma 7.3, 

P[r col > en 3 |lF 0 = w\ > P[r col > en 3 |lF 0 = w' e ( 2 dk ma , x )~ k ^ c+1) 

> 7(2 dk max )- k ™^ c+1 \ 

This completes the proof of the first half of inequality (7.13), with 5 = 7 ( 2 d/c max ) _fe ™ ax ^ (7+1 \ 
As with Lemma 7.3, the proof of the second half is essentially identical. □ 
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We further strengthen this to bound the probability that particles are close to colliding 
after a small number of steps. Let 

Tnear.j = Llf {t : inf |U ~ v\ < i} 

u,v: Wt[u]=Wt[v ]=1 

be the hrst time that two particles in the coalescence process are within distance i. 

Lemma 7.4. For all i E N and all 0 < 7 < 1, there exist e = e(c, d, fc max , 7 ) > 0 and 
C — C(c, d, k max , 7 ) < 00 so that, for all n > N(c , d, fc max , 7 ) sufficiently large, 

IP['hiear,i < en 3 \W 0 = w] < 1 - 7 , 

P[Tnear,i < <f>en\W 0 = w] < 1 - 7 

uniformly in 1 < k < fc max and vj E Gc' 1 ■ 

Proof. We begin by proving the hrst half of inequality (7.16). Fix vj E Gc\ fix 0 < 7 < 1, 
let <5 = 1 — —y an< i Ci e be the constants associated with 5 as given by Lemma 

7.3. By the definition of r near! i, there is a sequence F = ( w 0 = lF rrieaivi , 1/7 ,..., wf) so that 
\wj + i — Wj | = 1 and so that a collision occurs during the transition from Wi_i to U 7 . Let 
I E N be such that (pi = r near As in inequality (7.15), 

p Win w * I+ n■■■!= t\w h = w 0 ] > (2 dk max y\ 

Since 

P [<Pi+i ~ (pi > n 2 log(n)] = o(l), 

this implies 

P[ T col < Aear.i + T? log(n)] > ——y- tt (1 ~ o(l)) 

^za/c max J 

uniformly in starting position Wq . Thus, for all e > 0 and w E Gq\ 


7 


(4 dk n 


> P[t co i < en 3 \Wo = w] 

> P[r co i < en 3 , r neari i < ^n 3 |W 0 = w] 

> (1 - o(l))(2d/c max ) _ 'P[r nearii < ( -n 3 \Wo = w]. 


We conclude that 

P[Tnear,i < ^n 3 \W 0 = w] < ^(1 + o(l)), 

completing the proof of the first half of inequality (7.16). As with Lemma 7.3, the proof of 
the second half is essentially identical. □ 

We then have the following Corollary to Lemma 7.4. 


Corollary 8. Fix « 6 N. There exist e = e(c, d, k max ,i) > 0 and 5 = 5(c, d, fc max , i) > 0 so 
that, for all n sufficiently large, 

P[Tnear,i < en 3 \W 0 = w] < 1 - 8, 

P[niear,i < (pen\W 0 = w] < 1 - 8 
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uniformly in 1 < k < k max and w E ilk fl Q X) . 

Proof. This follows from Lemma 7.4 in essentially the same way that Corollary 7 followed 
from Lemma 7.3. □ 

7.2. Comparing KCIP with the Coalescence process. Recall k max from formula (7.1), 
the set ilk from formula (2.1) and the sets } from formula (7.10). In this section, we obtain 
bounds on the occupation measure of X t in ilk, uniformly in k<k mSLX . Recall the definition 
of the exit times Lk(x ) from Equation (2.2). Our intermediate steps are to show, uniformly 
in 1 < fc < k m ax , 

(1) Uniformly in x E ilk, P[-Lfc(x) < en 3 ] < 1 — 5 < 1 for some e = e(c,d,k max ) and 

3 3(c, d, fcmax)' 

(2) Uniformly in x E ilk, ^[Lk{x) > e - 1 n 3 ] <1 — 5. 

(3) Uniformly in x E il k , P[Et=o 1 x t en k < §e n 3 ] <1-5. 

(4) Uniformly in x E il k , P[X Lj . (Xo) E O fc _i|X 0 = x] > 5,P[X Lfc(Xo ) e O fc+ i|X 0 = x] > 5. 
We begin by proving item (4): 

Lemma 7.5. There exist e = e(c, d, k max ) > 0, 5 = 5(c, d, k max ) > 0 and a = a(c, d, fc max ) > 0 
so that, uniformly in 1 < k < k ms , x and x E ilk, we have 

en 3 

FIT, 1 x t en k > an 3 \X 0 = x] > 5. 

t=o 

Proof. Assume first that x E il k 0 . We couple {W}teN to a coalesence process 

using the same coupling as in Lemma 6.15 and define {W t } te N as in formula (7.6); since 
X 0 E ilk, we have r s ^ rt = 0 for all 1 < i < k. Recalling the definition of from 

formula (7.7), we fix T E N and consider the two events: 

(1) The coalescence process has T near ,2 < 4>t- 

(2) A 2 ,t- The KCIP has infjf : 5 t > 1} < d T . 

By Lemma 7.4, there exists some ei,y > 0 so that 

P[A,en] < 1 - 7 + 0 ( 1 ) (7.17) 

for 0 < e < ei- By Corollary 5, there exists some 62 so that for 0 < e < 62 , 

P[A-] < | + o(l). (7.18) 

Choose e = | min(e 1 , e 2 ) and fix a, (3 > 0. Set A en = Ai t£n U ^4 2 ,en- By inequalities (7.17) 
and (7.18), 

fin 3 P n 3 

lx t^ k - an3 \ X 0 = X ]> P E - an *\ X ° = X 1 AenMAen) 

t=0 t=0 

pn 3 

> > an 3 \X 0 = x, A c en ] + o(l). 

t =0 
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Conditionally on , fa + \ — fa has geometric distribution with mean between 
Thus, for all 3 sufficiently large, 

P[/TV < fa n < fin 3 } = 1 - 0 ( 1 ). (7.19) 

Since P[*4(; n ] > % + o(l) is bounded away from 0, this implies that, for /3 sufficiently large, 

fin 3 <t> en 

P[E l.Y t eo fc > an 3 |X 0 = x] > ^P[E lx t en fc > an 3 \X 0 = x, A c en ] + o(l). (7.20) 

t =0 t=0 

By checking allowed sequences of update variables {tq,p s } se N, if V t < k max and S t = 0, we 
have 

‘Zcdk 

P[<5 m = 1| 6 t = 0, V t < k max , t < fa n , A c en ] < P[<5t+i = 1| 6 t = 0, V t < fc max ] < ^ max , 
while if V t < fc max + 1 and S t — 1, 

P[^+l — 01<5* = l,Vt < fcmax + 1) t < (j) e ni A° e fa\ 

T P[^t+i = 0|5t = 1, V t < fc max + 1] 

-h (x+o[l)) ' 

Since the latter rate is @(n) times larger than the former, we have 

<Pen 

P[E 1 ^en fe > afa n \X 0 = x,A c en ] = 1 + o(l) 

t= o 

for all a~ l sufficiently large. Combining this with inequalities (7.19) and (7.20), we have for 
P, a sufficiently large that 

fin 3 

P[E - a P~ ln3 \ x o = x] > | + 0(1). 

t =0 

This completes the proof of the Lemma when x E O*. D Q X) . 

(n) 

We give a short argument reducing the case x E ilk to the case x E Ofcf, it is essentially 
the same argument given in Corollary 7. Fix x E ■ There exists x' E ilk H and 

a sequence of configurations T = (x = x 0 ,x 1} ... ,X£ = x') so that \x i+ i — x { \ < 1 and 
i < 16/c^ ax . The same argument as given in inequality (7.15) implies that there exists some 
fa = 7 '(c, d, fc max ) > 0 so that 

P[inf{£ : X t = x'} < n 2 log(n)|X 0 = x\ > fa. 

This bound reduces the case x E ilk to the case x E ilk D Q-fa , at the cost only of replacing 7 
with 77 ' and replacing en 3 with en 3 + n 2 log(n) = (1 + o(l))en 3 . This finishes the proof. □ 

Define 

p k = inf{t : X t E D fc }. 
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(7.21) 




(7.22) 


Lemma 7.6. For all 1 < k < fc max; there exists some S, e > 0 so that 

P[p fc -i < en 3 |X 0 = x\ > 6, 

F[p k+1 < en 3 \X 0 = x\ > S 
holds uniformly in x E kl k (where the first part of the inequality obviously requires k > 2). 

Proof. As argued in Corollary 7 (and again in the last paragraph of the proof of Lemma 7.5), 
we can assume without loss of generality that x E Q k fl , since for any fixed e > 0, the 
ratio 

_inC-g^ P[pfc_i < en 3 \X 0 = x] _ 

inf xen fc nei n) 1 < en 3 (! + o(l))l^o = A 

is bounded away from 0 uniformly in n. Thus, it is sufficient to consider only starting 
positions X 0 E Q k fi ■ 

Fix 1 < k < k mSLX and X 0 — x E tt k ft . We now couple {W}teN to a coalesence 
process {Zi} teN using the same coupling as in Lemma 6.15 and define {lW} te H according to 
formula (7.6). Since X 0 6 Q k , we have Tg(.( rt = 0 for all 1 < i < k. Dehne the sequence of 
movement times {fij}j> o as in formula (7.7) and the graph X as in formula (7.14). Fix e > 0; 
we will dehne T to be the collection of paths through the graph X for which the associated 
coalescence process has r nearj 4 < 0 en . More precisely, we say that a path V = (u>o, w\,..., w m ) 
through the graph X is in T if it satisfies: 

(1) m < en. 

( 2 ) |wj+i — vj t | = 1 for all 0 < i < m, where distance is measured according to the graph 
distance on X. 

(3) w 0 = x. 

(4) There exist u,v E G so that | u — v\ = A and w m [u\ = w m [v\ = 1. 

(5) For all u,v E G with \u — v\ = A and all 0 < i < m, we have Wi[u\iUi[v] = 0. 

For any path T G T of length m and any A > 0, we have by essentially the same calculation 
as in Lemma 4.1 that 

P[ sup S t < l|{0 m > An 3 }, {W ^}™o = T] > g(A) + o(l) 

0^t^Tnear,4 

for some function g(A) > 0. Combining this with inequality (7.19), we have for all A so that 
h is sufficiently large that 

P[{ sup S t < 1 } n ( 0 m < A7i 3 }\{W <Pl }? =0 = r] > g{A) + o(l). 

0^t^T ne ar,4 

In particular, there exist e, A, B > 0 so that 

P[{ sup S t <l}n{fi m <A7i 3 }\{W rPl }Zo = A>B + o(l). (7.23) 

0^t^T n ear,4 

We choose such a triple e, A, B for the remainder of the proof of the first half of inequality 
(7.22). Dehne p' k _ 1 = inf{t : Y t — k — 1}. By the arguments in Proposition 6.16, there 
exists y > 0 so that 

rWt-i < 4n 2 ' 5 |A'o = y]> V(1 + o(l)) 
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holds uniformly in y G ■ By the same reduction argument used in Corollary 7, this 

implies that there exists some 7 > 0 so that 

P[Pfc-i < 4n 2 ' 5 |X 0 — y]> 7(1 + o(l)) 

holds uniformly in y G Gtk\G^ ■ Recall the definition of the update variables ( pt,v t ) from 
Equation (1.1). Denote the event 

Xn(t) = {Vs : t <s <t + n log(n) 2 , p s > - or V" X s [u] = 0}. (7.24) 

n ' 

( u,v s )eE(A(L,d)) 

We then have 

P[p fc _i < 4n 2 ’ 5 + n log(n) 2 |X 0 = y\ (7.25) 

An 2 - 5 

> 1 + nlog(n) 2 |pfc_i = t]P[pjfe_i = t] 
t =0 

.. 4n 2 - 5 

> 5 E p[{uS 1o ‘ ( ” )2 {«.} = A(i, d)} n x„(t)bLi = = *] 

^ t =0 

4n 25 

>^(i+t'(i))E p W-i = f ] 

t=0 

> 1(1 + 0 ( 1 )). 

Combining this with inequality (7.23) and the observation that w m G for all T = 

(wq, ..., u> m ) G \h, we have 

P[p fc _i < An 3 + n 2 ' 5 + nlog(n) 2 |{lE ( j i .}"L 0 = T] > ^ + o(l). 

This implies that 

P[pfc-i < An 3 + n 2 " 5 + nlog(n) 2 |X 0 = x] 

> V P( Pl _, < +n 3 + „ 2 - 5 + nlog( n ) 2 |{WV }™ 0 = r]P[{W^J ” 0 = r|A'o = x] 
ref 

> = r|x„ = z] + 0(1) 

ref 

B'y 

= —P[r nearj4 < (j) en \X 0 = x} + o(l) 

B'y 

> — P[r col < (f> en \W 0 = x] + o(l). 

The last quantity, P[r co i < (j) €n |Md) — x\, is bounded away from 0 by Theorem 5 of [Cox89]. 
This completes the proof of the first half of inequality (7.22). 

We now prove the second half of inequality (7.22). Fix x G G^‘ } and e > 0. We consider a 
new collection of paths T on X. Roughly speaking, these will be the paths for which there 
are no near-collisions for many steps. More precisely, we say that T = (x — wo,w\,... ,w m ) 

is in T if 
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• m > \en. 

• \wi + i — Wi\ — 1 for all 0 < i < m. 

• Wo = x. 

• For all u,v G G with \u — v\ <4 and all 0 < i < m, we have Wi[u]wi[v] = 0. 

Recall that rj£ oupling is defined in formula (6.43) and Ctripie is defined in formula (6.41). 
If o = r G v h, then no near-collisions of W t have occurred by time </> m , and so 

conditioned on this event we have 


i™i P Kng > m in(^ m , mnOJe). 


"(j) 


Also, for any t 6 N and any starting point Ao = x G 0*, with = 1, we have 

P[ min S s > 1|A 0 = x] > (1 — —)*. 

o<s<t n 


(7.26) 


(7.27) 


In particular, the indicator function of the event {min 0 <, g < it S s > 1 } is stochastically dom¬ 
inated by a geometric random variable with mean Combining inequalities (7.26) and 
(7.27), we have 


p[{£i 


t =0 


*>i > ^ n 2 } U {nrm (triple < <t>m}\ {W^}™ 0 = T] = 1 - o(l). 


Noting that 


[mm fe Ctri P ie = t + l\{W 4>i }?L 0 = T,S t = 1,™ (triple > W < 4>i\ > ^ 


hi) 


we obtain 


4>n 


[mm (triU. > <Pm\{W^}T =0 = r, x *>i ^ 32 n2 ] ^ ( X - ^2) 


c 


< e 32 . 

Combining this with inequality (7.28) and rearranging terms, this implies 
P[min C ®, 8 < 0 m |{^jr=o = r] 


(7.28) 


> 1 - o(l) - P[ min (Sl, le > (f>m\{W^}? = o = T, ^ l 5t >i > —n 2 ] 

—t=o 


> 1 — e 32 — o(l). 

Summing over F 6 A. 

.ndn fc (t ( ri P ie < min( 0 m , r nearj4 )] > P [ min_ (^ < 0 m|{W / 0 I }”L o = r ] p [{W&}£o = r ] 


rex 


1 <j<k 


> 


(1 _ e-S - „(!)) £> 10 ^}”, = r] 


rex 


> (1 - e 32 - o(l))P[r n ear,4 > (>§]• 
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Write Ctripic = m ^ n i<i<fc Ctripie- By Corollary 8 , we conclude that there exists some 7 > 0 so 
that 


PfCtopS < min (0m, r ne ar,4)] > 7 + o(l). 

Combining this with the calculation in inequality (7.25), 

F[p k+1 < <% + n log(n) 2 |X 0 = y\ 

..(min) | i / \2 

> P[{U“SJ ° S n R} = A {L, d)} n Xn (Ctriple ) I Ctriple < min (0f n, W 7 ), W 0 = y\ 

^triple 

X PfCtripk < min(0e„,T n ear,4)|A:o = y] 

>(l-o(l))(7-o(l)). 


Combining this with inequality (7.19) completes the proof. 


□ 


8. Proof of Theorem 2 


In this section, we find bounds on the occupation times of Q k and use these bounds to 
finish the proof of Theorem 2. Recall from (3.2) that the quantity K k applied to X t is 

T 

K k (T) = ^ l.Y t eOfc- 

t =0 

Lemma 8.1 (Bound on Occupation Measures). There exists 0 < 7 < 00 so that, for all A 
sufficiently large and all B > 0, 

£ 

F[n k (An 3 log(n)) < Bn 3 log(n)] < 7 — + o(l) 

L 

uniformly in X 0 — x G and 1 < k < fc max . 


Proof. Recall the set /C from (7.4) and /c max from (7.1). By Corollary 6 , there exists <7 > 0 
so that for all A sufficiently large, 

An 3 log(n) 

P[ ^ 1 x t eic > c 2 An 3 log(n)|X 0 = x e fi] = 1 + o(l). (8.1) 

t =0 

Recall the definition of the update variables ( pt,v t ) from formula (1.1) and let Xn = Xn(0) 
from formula (7.24). For any x G /C, we have that 


P 


inf{t >0 : X t e /C fl A k Q k } < nlog(n) 2 |X 0 = x € K 


> P 


{uS (n)2 {uJ = A (L, d)} O Xn\Xo = xeic 
= 1 + 0(1). (8.2) 

Combining inequalities (8.1) and (8.2) with Lemma 7.5, there exists some C 2 > 0 and 1 < 
k' < fc mav so that, for all A sufficiently large, 

An 3 log(n) 

p [ > C 2 An 3 log(n)|Xo = x E fl] = 1 + o(l). (8.3) 


t =0 
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(8.4) 


Inequality (8.3) implies the existence of some C 3 > 0 so that, for any X 0 — x 6 fl, 

An 3 log(n) 

E[ Y 1 A' t GO fc ,] > c 3 7ln 3 log(n) 

t=0 

for all A sufficiently large. We claim that inequality (8.4) also holds with k' replaced by 
k! + 1 and also (if k! > 2) with k! — 1. By Lemmas 7.6 and 7.5, there exists some C 4 > 0 so 
that 


An 3 log(n) An 3 log(n) 

E[ E > c 4 E[ Y 1 x t en k ,} > c 4 c 3 An 3 log(n), 

t=0 t=0 

An 3 log(n) An 3 log(n) 

E[ E Ijqe n fc ,_J > c 4 E[ ^enj > c 4 c 3 dn 3 log(n). 

t=0 t=0 


(8.5) 


This is exactly inequality (8.4) with k' replaced by k' + 1 and k' — 1 respectively. Since the 
set {1, 2,..., fc max } is finite, the argument between inequalities (8.4) and (8.5) implies that 
there exists some c 5 > 0 so that 

An 3 log(n) 

E[ Y 1 x t en j ] > c 5 An 3 log(n) ( 8 . 6 ) 

t .=o 


for all 1 < j < fc max . Since Kk(T ) = Yt=o lx t en fc , the result now follows from an application 
of Markov’s inequality to inequality ( 8 . 6 ). □ 


Finally, we prove our main result: 


Proof of Theorem 2. The lower bound in Theorem 2 is given in Theorem 3. We now show 
the proof of the upper bound in Theorem 2 by applying the bounds in Lemma 5.1 and 
Lemma 8.1 to Lemma 3.1. 

Lemma 8.1 implies that there exist constants 0 < A, B < oo, and a function N = 
N(A, B , c, d) so that for all k < fc max (see formula (7.1)) and all n > N, 

supP[«: fc (An 3 log(n)) < Bn 3 \og(n)\X 0 = x] < —. (8.7) 

xen 16 

Similarly, Lemma 5.1 implies that there exists a constant 0 < C\ < oo so that 

max T ntk < Crn 2+ ^ log(n). (8.8) 

l^fc^fcmax 

In the notation of Lemma 3.1, let 0 = f2, let 0*, = for 1 < k < | and let 0» + i = 

n_ 

Set I = { 1,2,..., k max }. By Theorem 6.1, if the KCIP is started at the stationary 
distribution 7r, there exists 0 < a, e < 1 so that 

E[W] = E[K™ 3 ] < (1 - a)E(Vb) + C G . 

Thus we get E(V r 0 ) < ^ ^ f Markov’s inequality yields that 

e[Vo < fc max ] > i 
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Since Vo is distributed according to tt, this immediately yields 

7r(U fce/ fi fc ) > j - o(l). 

Thus, in the notation of Lemma 3.1, we can fix for all n > N — N(c,d,A,B ) sufficiently 
large, 

3 H 1 

°=8’ ^ ~ 16’ 7 = IP 

t = 16c' 1 Cin 2+ ^ log(n) < -B?r 3 log(n), 

T = \An 3 log(n)] 


where C\ is the constant from (8.8). By (8.8), we have 


max —— 1 

l^fc^fcmax t 

By (8.7), we have that for 1 < k < fc max , 

sup F[n k (T) < t\X 0 


1 

16 

(8.9) 

_i 

I T-1 

VI 

TL 

(8.10) 


Combining inequalities (8.9) and (8.10) immediately implies that, in the notation of Lemma 
3.1, T < T = 0(n 3 log(n)). Thus by Lemma 3.1 we have that r m i X = 0(n 3 log(n)) and the 
proof of Theorem 2 is complete. □ 


9. Conclusion and future work 

We have resolved only some special cases in Aldous’ conjecture. In particular, we do not 
have any results for: 

• Graphs other than the torus. 

• Density regimes other than p = 

In this section, we give conjectures for other graphs and regimes, and mention cases for 
which our methods work well. Before doing so, there is the question as to whether the 
adjustment to Aldous’ conjecture required by Theorem 3 is essentially the only required 
correction. If so, this would suggest the conjecture: 

Conjecture 9. Fix d G N and let G be a d-regular graph. The mixing time r m ; x of the KCIP 
with parameter p on graph G is 0(p~ 1 d\G\r^ + J^j) as |Gj goes to infinity, where is 
the mixing time of the \-lazy simple random walk on the graph G. 

We believe that this gives the correct answer for the torus in dimension d > 3 (that is, we 
conjecture that the correct mixing time for the process described in Theorem 2 is 0(n 3 )). 
However, we do not believe that Conjecture 9 is true in general. In particular, define the 
coalescence time Tc oa i of a graph to be the expected time for the coalescence process on G to 
go from |Cr| particles to 1 particle. We suspect that the mixing time of the KCIP is bounded 
from below by the coalesence time of the associated coalesence process on the same graph. 
By [Cox89], this suggests that the mixing time for the torus in dimension d = 2 is at least 
n 3 log(n), while Conjectures 1 and 9 both suggest the mixing time is 0(n 3 ). In general, we 
conjecture that: 
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Conjecture 10. Fix d £ N and let G be a d-regular graph. The mixing time r m ; Y of the 
KCIP with parameter p on graph G is 0(p~ 1 d\G\rf^f + + p _1 d|G|rc 0 ai) os |Gj goes to 

infinity, where is the mixing time of the \-lazy simple random walk on the graph G and 
Tcoai 'is the coalesence time of the graph G. 

Next, we discuss when our strategy outlined in Section 2.2 may be applicable for proving 
Conjecture 10. When restricted to densities in the regime p = -, our proof strategies 
are likely to work well for many other sequences of bounded-degree graphs. In particular, 
for random triangle-free d-regular graphs, our argument goes through with only two major 
changes. The first change is to replace all bounds on the coalesence process from [Cox89] 
with analogous bounds from [01il2b] and [01il3]. These bounds are substantially looser, 
but strong enough for our arguments to go through. Next, random d-regular graphs are 
expanders with high probability (see, e.g., [HLW06]), and in particular have spectral gaps 
that are uniformly bounded below in n and mixing times that grow like log(n). This allows us 
to make the second change, replacing the bound on the log-Sobolev constant from [Yau97] 
with a bound on the spectral gap from [HLW06]. Besides the invocation of [Yau97], all 
comparison arguments for the log-Sobolev constant of A(L, d) given in Section 5 also apply 
as written to bounding the spectral gap of general d-regular graphs. These graphs have such 
large spectral gaps that the bounds obtained this way are sufficient. Beyond expanders, we 
generally expect our strategy to succeed for families of bounded-degree graphs with 

max(T C oal) Tmix) = 0(n 3 ). (9.1) 

In particular a similar approach works for the lattice in d > 3 dimensions. When (9.1) fails 
to hold, as with A(L, d) in dimension d = 2, the arguments in Section 6 must be substantially 
changed. 

For sequences of m = m(n)-regular graphs (with m{n ) very slowly growing with n), our 
strategy could be refined to give nontrivial bounds, but our results are not very useful as 
written. For graphs with very large degrees d ~ n, a straightforward comparison argument to 
the KCIP on the complete graph analogous to that given in [DSC93] for the simple exclusion 
process gives useful bounds. 

Our strategies are unlikely to work well for p = p n ^ -, as the arguments this paper rely 
quite strongly on the stationary measure of the constrained Ising process being concentrated 
on configurations with few particle. Despite technical difficulties, we believe that theO(n 3 ) 
bound will hold up to p n ~ n~a~d. Again, for p sufficiently large, a straightforward com¬ 
parison argument to the constrained Ising process on the complete graph analogous to that 
given in [DSC93] for the simple exclusion process gives useful bounds. 
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